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Introduction 


Terje Lohndal 


Norwegian University of Science and Technology 


1. From Old Norse to Zoque 


‘The present volume consists of a number of different papers that all in some way 
or other are related to the study of the human Faculty of Language. They represent 
the view that in order to understand and study the Faculty of Language, it is im- 
portant to approach it from a wide variety of perspectives. We study the syntax and 
semantics from synchronic and diachronic perspectives, we look at many different 
languages, we look at how children acquire languages, we think about how the 
distinctively human capacity for language has evolved, and we study language use 
and how it is impacted by various sociological factors. All these aspects are crucial, 
and although not everyone can study all of these, they will have to be part of an 
overall theory of human language(s). 

That has also been the view of the person in whose honor this volume appears, 
professor Jan Terje Faarlund. For the past 40 years, he has contributed to deepening 
our understanding of the Faculty of Language. In particular, he has done this by 
studying Old Norse and contemporary Norwegian and how Old Norse developed 
into what eventually became modern Norwegian. As an undergraduate, he studied 
Norwegian and French, and he got the Magister in 1974 on a dissertation surveying 
the syntax of prepositional phrases in Norwegian. He then went on to make sub- 
stantial contributions to our understanding of language change, through detailed 
studies of Old Norse, in particular (see his two books Syntactic Change: Towards a 
Theory of Historical Syntax (1990) and The Syntax of Old Norse (2004)). From a 
synchronic Norwegian grammar perspective, Faarlund’s greatest achievement has 
been to co-author the most comprehensive reference grammar of Norwegian. The 
grammar consists of 1223 pages and was published in 1997. Faarlund wrote the 
book together with his colleagues Svein Lie and Kjell Ivar Vannebo. 

In most of his work, Faarlund has adopted a Chomskyan generative grammar 
perspective, which has lead him over the past decade to spend more time on gen- 
eral linguistic questions. In particular, he has written a popular introduction to 
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the work of Noam Chomsky (2005). This is the only book about this topic in 
Norwegian, and in addition to introducing Chomsky’s philosophy of language; it 
also provides a rather detailed example of how to analyze several facts from 
Norwegian. 

In recent years, Faarlund has taken an interest in language evolution. For sev- 
eral years he co-taught a seminar on the evolution of language for MA and PhD 
students with professor of biology Erika Hagelberg. He has also made his own 
contributions to the growing field of literature on language evolution, arguing for 
a Chomskyan perspective also in this work. 

In the late 1980s Faarlund and his late wife Marianne Gullestad visited 
Chiapas in southern Mexico. His first encounter with the Zoque language was 
from reading William Wonderly’s 1951 dissertation on Copainala Zoque. Later, 
upon meeting people from Ocotepec, which is another Zoque-speaking commu- 
nity in Chiapas, he realized that their dialect was very different from Wonderly’s 
description. Other varieties differ as well, and combined with the fact that there 
are very few Zoque speakers left in Copainala, this triggered Faarlund to begin his 
work on a new description of the language. This culminated in the publication of 
a 200-page grammar of Chiapas Zoque from Oxford University Press in 2012. The 
main focus is on the empirical side, but he uses Chomskyan generative grammar 
to describe the language. 

In addition to working on grammars from a descriptive and theoretical per- 
spective, Faarlund has also spent a lot of time on the New Norwegian/Nynorsk 
variety of Norwegian and questions concerning its orthography. He was a member 
of the Norwegian Language Council for many years, and he chaired a committee 
whose work prepared the ground for the latest comprehensive reform of the 
Nynorsk variety of written Norwegian (see Royneland’s contribution in this vol- 
ume for comprehensive discussion). As a result of this work, Faarlund has also 
vigorously participated in many public debates over the years. 

The present volume honors Faarlund by presenting essays that together rep- 
resent all the areas where he has done research. The unifying theme is that they 
all in some way or other relate to the study of Universal Grammar. The majority 
of the essays adopt a generative perspective like Faarlund, though some essays 
take slightly different perspectives. This is either because they do not agree with 
the standard Chomskyan perspective (e.g., Mufwene), or because their questions 
are of a different type (Royneland, Trudgill). Sociolinguistic work is important to 
provide us with a better understanding of the boundaries of Universal Grammar, 
and due to Faarlund’s contributions in this realm, two chapters on this topic are 
included. 
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2. Outline of the chapters 


In what follows, I will briefly outline each of the chapters. The chapters have been 
organized into topical sections that very roughly reflect the development of 
Faarlund’s own research. 

Tor A. Afarli and Christer Platzack have all written papers on the Scandina- 
vian languages. 

Afarli discusses the accusative/dative alternation for spatial PPs in a 
Norwegian dialect. He presents a minimalist analysis in which the dative/locative 
type is the default version of any given alternating preposition, and that the accu- 
sative/directional version only occurs when the preposition is the head of a small 
clause predicate PP in the complement domain of the verb phrase. Afarli connects 
the accusative/directional property to a resultative head in the aspectual structure 
of the verb phrase. 

Platzack discusses the absence of overt subjects in certain finite clauses in 
Swedish. He analyzes these, from the perspective of current minimalist theory, as 
instances of spurious topic drop, which enables us to maintain the strong subject 
coercion requirement in languages like Swedish (and English). His proposal incor- 
porates recent work on A- and A-bar movement, and he suggests that topic drop 
involves deletion in SpecCP. 

In addition to the three grammatical papers, there is also a section on socio- 
linguistics, mostly drawing on data from Norwegian varieties. 

Unn Royneland discusses Norwegian language reforms in the 21st century, 
with particular attention to the two most recent committees. She discusses whether 
the change in perspective, that is, including ‘the voice from below; has had a sig- 
nificant impact on the outcome of the committees’ work. She concludes positively 
and argues that people's opinions had a real impact on the very content of the final 
reform that was effective as of August 2012. 

Peter Trudgill also focuses on Norwegian, but on the issue of gender mainte- 
nance and loss in West and North Germanic. His point of departure is that some 
varieties have retained the original three grammatical genders inherited from 
Indo-European, whereas others, such as English and Afrikaans, have lost gram- 
matical gender altogether. Others have reduced their number of grammatical gen- 
ders from three to two. Trudgill explores the phenomenon of differential feminine 
gender loss and maintenance in Germanic. He argues for a perspective that links 
social and linguistic structure in accounting for these changes. 

Jan Terje Faarlund studied French as a student, and Hans Petter Helland and 
Christine Meklenborg Salvesen contribute chapters that primarily focus on French. 

Helland discusses non-finite ‘gerundives’ (gérondif) in adjunct positions and 
how to analyze the preposition en that precedes the gerundive. He argues that en 
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should be analyzed as a prepositional complementizer. He also compares the ge- 
rundive to structures with present participles. 

The topic of Salvesen’s chapter is cartography, and more concretely the left 
periphery in Modern Germanic and Old French. She argues that Old French and 
Modern Germanic are true V2 languages, and that they activate the same position 
in the left periphery, that is, that the finite verb moves to the same head. She argues 
that whereas Old French shares syntactic properties with the Germanic languages, 
it has the pragmatic properties of the Romance languages. 

In the section on diachrony, Werner Abraham and Elly van Gelderen discuss 
facts from both the Slavic languages and the Germanic languages. 

Abraham starts from the point of view of the emergence of the analytic past in 
German, and discusses a similar construction in Polish. He considers what the 
status of the new auxiliary is in Polish, whether or not it follows the same gram- 
maticalization pattern as medieval German, and whether or not the phenomenon 
is to be traced back to contact between German and Polish. Abraham concludes 
that the new Polish periphrastic past is not a result of language contact. 

Van Gelderen’s topic is diachronic changes in pronouns and demonstratives. 
She discusses anaphoric marking in English (and briefly in Scandinavian). Her 
claim is that in Old English personal pronouns are not deictic/referential but that 
demonstrative pronouns have this function. In Middle English, the situation has 
been reversed. Van Gelderen discusses the factors behind this, and she focuses on 
internal (the use of demonstratives before nouns) and external (new personal pro- 
nouns due to language contact) factors. 

The topic of the chapters by Alice Harris and Jerrold Sadock are what lesser- 
studied languages can tell us about grammatical theory. 

Harris considers the process of metathesis in Batsbi and how it emerged. She 
claims that metathesis in Batsbi is a result of grammaticalization together with 
regular phonological processes. She builds on previous work done together with 
Jan Terje Faarlund, which argues that affixes trapped between the base and a new 
affix are lost, and argues that in Batsbi some of these trapped affixes were not im- 
mediately lost. This is the source of the variable tense marker, and its position was 
reanalyzed as metathesis. 

Sadock discusses how noun phrases without articles are interpreted as either 
definite or indefinite. He provides one case study from Old Norse and another one 
from English. He argues that the direct correlation between the presence of an 
overt article and an indication of (in)definiteness, and the absence of an article and 
the lack of such an indication cannot be maintained. 

David Lightfoot and Marit Westergaard discuss acquisition and its relevance 
for grammatical architecture. 
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Lightfoot’s topic is explanation in diachronic syntax, and how that requires 
good synchronic analyses of the initial and new I-languages. It may also require 
historical linguists to link the new I-language with particular features of the exter- 
nal language to which the innovating children were exposed. Lightfoot argues that 
this enriches our understanding of some of the idealizations of synchronic syntax, 
in particular the relationship between the Primary Linguistic Data and the gram- 
mars triggered. 

Westergaard is concerned with how to think of parametric variation from the 
perspective of language acquisition. She reviews some traditional generative work 
on parameters as well as some recent constructivist work, and discusses how chil- 
dren handle variation that they are exposed to in the input. Her case study is word 
order and she finds no evidence for parameters in language acquisition data. In- 
stead, she argues for a model of micro-cues, according to which children are sensi- 
tive to fine distinctions in syntax and information structure from early on. 

In the last section of the book, Erika Hagelberg and Salikoko Mufwene take 
us into the domain of language evolution. 

As the only non-linguist among the contributors, Hagelberg discusses what 
evolution and its research methods can tell us about how language came to be. She 
reviews a number of recent studies and explores what they might tell us, including 
studies of the physical and cognitive evolution of modern humans, the relationship 
between social complexity and brain size, the gestural theory of language, theory 
of mind, and the use of molecular genetics to identify putative language genes. 

Mufwene’s goal is rather different. He explores a different theoretical under- 
standing of language, namely language as a hybrid and complex technology. He 
argues that language evolved primarily to facilitate efficient communication in 
populations whose social structures were becoming increasingly more complex. 
From that perspective, he asks what it can tell us about the evolution of language. 
He argues that the evolutionary trajectory was complex and with several options 
and that considerations of optimality are absent. 
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Scandinavian 


On the syntax of the accusative/dative 
alternation in spatial PPs in Norwegian 
dative dialects 


Tor A. Afarli 
Norwegian University of Science and Technology (NTNU), Trondheim 


This paper investigates the systematic alternations between accusative and 
dative case that is found with spatial prepositions in Norwegian dative dialects. 
This case alternation is correlated with directional and locative interpretations 
respectively, and is a phenomenon well known from other (Germanic) 
languages. The paper argues that the dative/locative type is the default version 
of any given alternating preposition, and that the accusative/directional version 
only occurs when the preposition is the head of a small clause predicate PP 

in the complement domain of the verb phrase. In that case, the accusative/ 
directional property is imposed by a selecting resultative head in the aspectual 
structure of the verb phrase. In the absence of a resultative head, a PP may be 

a small clause predicate in the complement domain of the verb phrase and still 
retain its default locative/dative properties. Thus, it is correctly predicted that 
we find presentational sentences with both stative and activity verbs taking a 
locative/dative PP as a small clause predicate. 


1. Introduction 


The case distinction between accusative and dative is generally lost in the dialects 
of Mainland Scandinavian, but older speakers of many rural dialects in (very 
roughly) the middle part of Norway and in the western middle part of Sweden still 
use dative case on indirect objects and with a hand-full of verbs, adjectives, and 
prepositions (see e.g. Reinhammar 1973; Sandoy 2005; Eythdérsson, Johannessen, 
Laake & Afarli 2010; Afarli & Fjosne 2012; Anderson 2010). 

In this paper, I investigate the systematic accusative/dative alternations with 
spatial prepositions that is found in the Norwegian dative dialects. This case alter- 
nation is correlated with directional/locative interpretation and is a phenomenon 
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well known from other (Germanic) languages (see e.g. Zwarts 2006, van 
Riemsdijk 2009). The alternation is illustrated in the following pair, taken from my 
own dialect (the Halsa dialect in More og Romsdal county), which is the dialect I 
use as my empirical source throughout. The PP is set in bold. 


(1) a. Hainn hoppa ni baten. (directional) 
he jumped in boat.the.acc 
‘He jumped into the boat? 


b. Hainn stod ni bata. (locative) 
he stood in boat.the.DAT 
‘He was standing in the boat? 


2. A syntactic difference 


First, take a closer look at the type illustrated in (1b), where dative is used in a 
locative PP, see (2). 


(2) a. Vi undersokt pasienten pa lafta. (locative) 
we examined patient.the on room.the.DAT 
“We examined the patient in the room’ 


b. E mesta ringjen min ni siloa. (locative) 
I lost ring.the my in silo.the.paT 
‘T lost my ring in the silo’ 


c. E bruka kniven sist puinn bora. (locative) 
I used knife.the last under table.the.paT 
‘T used the knife last time under the table? 


In the postverbal DP + PP sequences in (2), Iassume that the DP is a direct object 
and the PP is an adjunct. This is supported by the fact that the PP is fully op- 
tional in each instance, the subject + verb + object complex forming a complete 
semantic unit in itself. Moreover, the DP and the PP can easily be separated by 
another adjunct, as actually illustrated in (2c), where the adverbial sist ‘last’ is 
positioned after the object but before the PP. This also supports the contention 
that the PP is an adjunct. 

Assuming that an adjoined PP is free from influence from elements inside the 
verb phrase (i.e., no head inside the verb phrase c-commands or selects an ad- 
joined PP), it seems reasonable to assume that spatial prepositions are basically 
locative and that they assign dative case by default, as stated in (3). 


(3) In Norwegian dative dialects, an alternating spatial preposition is basically 
locative and assigns dative case to its complement as a default property. 


The accusative/dative alternation in spatial PPs 
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This is in line with van Riemsdijk (2009), who maintains that the dative is the de- 
fault case in corresponding case alternating prepositions in German. 

Now, consider some examples where accusative is used in a directional PP, 
see (4). 


(4) a. Vi la pasienten pa lafte. (directional) 
we put patient-the on room.the.acc 
“We put the patient into the room: 


b. E kasta ringjen min ni silo’n. (directional) 
I threw ring-the my in silo.the.acc 
‘I threw my ring into the silo’ 


c. E gjomt kniven  puinn bore. (directional) 
I hid  knife-the under table.the.acc 
‘T hid the knife under the table’ 


Observe that the prepositions used in (4) seem to be the same prepositions 
(pd, ni, puinn) that are used in (2), even though both the interpretation of the PP 
and the case on the PP-internal DP are different in the two sets. Confronted with 
data like (2) and (4), one might ask whether there are two homonymous preposi- 
tions — one locative, one directional - with different case assignment properties, or 
whether the case differences and the locative vs. directional readings of spatial 
prepositions should be related to differences in syntactic structure. 

I will pursue the second possibility, assuming that each spatial preposition can 
have a locative or a directional reading depending on the syntactic structure it is 
situated in. Specifically, I will argue that the PPs in (4), as opposed to those in (2), 
are not adjuncts, but rather small clause predicates with the postverbal DP a small 
clause subject. Therefore, the directional PPs in (4) are c-commanded and selected 
by a head inside the verb phrase, which, I will argue, imposes a directional reading 
on the preposition and (non-default) accusative case on the DP inside the PP. 
Similar analyses are found in Fjosne (2007), Fjosne & Afarli (2007), Arsenijevic & 
Gehrke (2009), and Anderson (2010). 

Support for a small clause analysis of the relevant sequences in (4) is the fact 
that an adjunct is not as easily inserted between the DP and the PP in (4) as it is in 
the corresponding DP + adjunct PP structures in (2). Still, this type of evidence is 
not conclusive because a predicate PP in a small clause may be shifted to the right 
of an adjunct, thus blurring the data. Another piece of evidence for a small clause 
analysis is the observation that the PPs in (4) are not fully optional on the reading 
where the verbs are interpreted as resultative or directional; this is seen most clear- 
ly in (4a) where the exclusion of the PP changes the meaning of the verb to an 
accomplishment reading. Thus, on the intended directional reading, the PPs in (4) 
appear to be more or less obligatory. 
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However, crucial evidence for a small clause analysis in (4) is found when 
considering data involving anaphors. Notice first that an anaphor in Norwegian 
must, as a main rule, be contained in a predicate whose subject is the antecedent 
of the anaphor (Hellan 1988: 73-74). This is illustrated in the simple examples in 
(5) with data from several variants of Norwegian, i.e. both dative and non-dative 
variants. 


(5) a. Pasienten; [sov pa rommet sitt;] -Nynorsk written variety 
patient-the sleeps on room.the self.poss 
“The patient sleeps in his room’ 


b. Pasienten, [sover pa rommet sitt,] -Bokmal written variety 
patient-the sleepson room.the self.poss 
“The patient sleeps in his room? 


c. Pasienten, [sov pa lafte sett,] —non-dative dialect variety 
patient-the sleeps on room.the self.poss 
“The patient sleeps in his room? 


d. Pasienten, [sov pa lafta sin]  —dative dialect variety 
patient-the sleeps on room.the.paT selfposs.DAT 
“The patient sleeps in his room? 


Now, if the directional PPs in (4) can contain an anaphor that takes the postverbal 
DP as an antecedent, the postverbal DP must be a small clause subject with the PP 
as its predicate. As shown in (6), this prediction is borne out. 


(6) a. Vi la [pasienten, [pa lafte sett,]] 
we put patient-the on room.the.acc self.poss.acc 
‘We put the patient into his room: 


b. *Vi la [pasienten, [pa lafte hass.]] 
we put patient-the on room.the.acc his.poss.acc 
‘We put the patient into his room: 


This strongly suggests that the DP + PP sequences in (4) are indeed small clause 
structures. 

On the other hand, notice that an anaphor inside a locative adjunct PP, like in 
(2), cannot take a postverbal DP as antecedent, cf. (7a). In such cases, a pronomi- 
nal must be used, as in (7b).! 


1. Notice that the contrast in (7) is not equally clear in all Norwegian dialects. In particular, it 
seems that it is weaker or non-existent in dialects of Northern Norway, which may be due to a 
weakening of the predicate requirement on anaphors in those dialects. 
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(7) a. *Vi [undersokt pasienten,] [pa lafta sin,] 
we examined patient-the on room.the.paT self-poss.DAT 
“We examined the patient in his room’ 


b. Vi [undersokt pasienten,] [pa lafta hass;] 
we examined patient-the on room.the.paT his.poss.DAT 
“We examined the patient in his room’ 


I conclude from this that the directional/accusative type illustrated in (4) is a 
small clause structure, whereas the locative/dative type illustrated in (2) is a 
DO + PP-adjunct structure. In other words, the locative/dative type in (2) has a 
structure like the one in (8), whereas the directional/accusative type in (4) has a 
structure like the one in (9). 


(8) ..[V DP] PP 
(9)... [V [DP PP]] 


Taking the difference between (8) and (9) as my point of departure, I will try in the 
next section to further develop and refine my analysis in order to explain the 
accusative/dative alternation under discussion. 


3. Articulating the analysis 


Since, according to (3), the locative/dative type is the default type as far as the al- 
ternating spatial prepositions are concerned, it seems natural to assume that these 
default properties must somehow have been converted in the directional/accusative 
type. I will base the refinement of my analysis on this presumption, and I further 
assume that this conversion may only take place if the spatial PP is contained in 
the complement (c-command) domain of a relevant aspectual head in the verb 
phrase, specifically implying that it is selected by that head. 

Introducing the notion of aspectual head suggests that I will adopt a more ar- 
ticulated analysis of the verb phrase than I have done up to now, which is precisely 
what I want to do. More specifically, I want to adopt a decomposition analysis of 
the verb phrase of the type that is put forward in Ramchand (2008). 

To illustrate, (10) shows the outline of an analysis based on Ramchand (2008) 
of the sentence She broke the stick to pieces in the kitchen. In Ramchand’s system, 
the verb phrase is (maximally) composed of the aspectual heads init(iation), 
proc(ess), and res(ult), see Ramchand (2008) for further elucidation of these 
terms/notions and the analysis that is based on them. 
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(10) initP 


init procP 


break ne PP 


the stick proc’ in the kitchen 


proc resP 


<break> LF rR 


<the stick> res’ 


By 


res PP 
<break> to pieces 


Notice that the free-standing PP in the kitchen in (10) is assumed to be adjoined to 
the aspectually complex verb phrase; the precise adjunction site is not indicated 
here. Crucially, the adjoined PP is not c-commanded or selected by any of the as- 
pectual heads in the verb phrase. On the other hand, the resultative PP to pieces is 
c-commanded by all three aspectual heads. 

Consider now (2a) vs. (4a), ie. the locative/dative type vs. the 
directional/accusative type, repeated below for convenience. 


(2) a. Vi undersokt pasienten pa lafta. (locative) 
We examined patient-the on room.the.DAT 
“We examined the patient in the room? 


(4) a. Vi la pasienten pa lafte. (directional) 
we put patient-the on room.the.acc 
‘We put the patient into the room: 


I start with the locative/dative type illustrated in (2a). Given a Ramchandian de- 
composition analysis, (2a) has the partial structure shown in (11). Notice that the 
verb undersoke ‘examine’ is an activity verb which implies that the structure in- 
cludes both an init head and a proc head. 


(11) initP 
vi init’ 
init procP 
undersokt ae PP 
pasienten proc’ pa lafta (DAT) 


ei 


proc 
<undersokt> 
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Since this example does not have a resultative interpretation, there is no res head 
that selects the PP, only init and proc heads. The PP is adjoined, and since there is 
no res head, the PP must receive its default reading and case, which is locative/dative 
according to (3). 

Consider now the directional/accusative type in (4a). In contrast to (2a), (4a) 
indeed has a resultative interpretation, which presupposes the presence of a res 
head. Furthermore, the PP is a small clause predicate, cf. (9), which I take to mean 
that the PP is the complement of and selected by the res head (i.e., resP = small 
clause). This imposes a directional reading on the basically locative preposition. 
Moreover, a locative preposition converted to a directional preposition in this way 
assigns accusative case. The partial structure is given in (12). 


(12) initP 
vi init’ 
init procP 
la 
pasienten proc’ 
proc resP 
<la> seer 
<pasienten> res’ 
res PP 


<la> pa lafte (ACC) 


Conclusion: (2a) and (4a) do not, strictly speaking, contain two different homony- 
mous prepositions. Instead, what happens is that the given spatial preposition gets 
a locative/dative interpretation or a directional/accusative interpretation, depend- 
ing on the structural position of the PP. Specifically, if the PP is c-commanded and 
selected by a res head, the P is interpreted as a directional/accusative P. The rele- 
vant structures are investigated in more detail in the next section. 

Notice for the sake of completeness that a resultative structure with a 
directional/accusative PP may very well contain a locative/dative PP in addition to 
the directional/accusative PP. This is illustrated in (13) 


(13) Vi la pasienten pa lafte nisa husa 
we put patient-the on room.the.acc in that.paT house.the.DAT 
‘We put the patient into the room in that house? 


Here, the directional/accusative PP is analysed as in (12), and the locative/dative 
PP is adjoined to the structure, like in (11). Thus, the preposition ni ‘in’ gets its 
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default reading, since, being positioned in an adjoined position, it is out of reach 
from selection by the res head. 


4. The structure of spatial PPs 


I have concluded that alternating spatial prepositions may change their semantic 
and case nature dependening on the syntactic environment of the PP. What does 
this change consist in, more concretely? 

It is often assumed (see e.g. Jackendoff 1983, Svenonius 2010, among others) 
that complex directional or path prepositions incorporate a locational or place ele- 
ment. For instance, the PP into the lake may be analysed as shown in (14), with 
subsequent incorporation of <place> into <path>, thus forming into. 


(14) en 


<path> 
to 


<place> DP 
in the lake 


To illustrate further, consider the locational/directional ambiguity shown by PPs 
with a verb like jump, see (15). 


(15) John jumped in the lake. 


(15) may mean either that John was situated in the lake while jumping up and 
down (locational reading), or it could mean that John jumped into the lake 
(directional reading), see Tungseth (2008) for an investigation of corresponding 
locational/directional ambiguities based on Norwegian data. 

Now, in the locational interpretation, the PP is is a simple <place> PP with a 
structure like (16). 


(16) ae 


<place> DP 
in the lake 


In the directional meaning, the PP has a structure like (17), with an abstract <path> 
head c-commanding the <place> head. 


(17) wear 


<path> 
a 


<place> DP 
in the lake 
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In this reading, the preposition could also have been into, in which case the ab- 
stract path head would have been lexicalized as to, as shown in (14). The simple 
point made here is that a PP may contain an abstract <path> layer that is not lexi- 
calized, and the addition of this layer seems to be quite flexible, as it were, so that 
we have pairs of homonymous prepositions where one member of the pair is a 
simple <place> preposition, whereas the other member has an abstract <path> 
layer added. Thus, the change or conversion from a locative to a directional read- 
ing is here analysed as the addition of a <path> layer on top of a <place> layer in 
the PP. My earlier analysis assumed that a directional reading is triggered by selec- 
tion of the PP by a res head. Now, this analysis can be sharpened. What is going on 
is that a selecting res head triggers the addition of a <path> layer in the PP, thus 
giving rise to the directional reading. 

Applying this analysis to the jump in the lake example, jump in the lake with a 
locational reading has the structure shown in (18), whereas jump in the lake with 
a directional reading has the structure shown in (19), where that path layer is im- 
posed from the res head in (19). 


(18) [DP [init [DP [proc]]]] [pp P<place> DP] 

(19) [DP [init [DP [proc [DP [res [,, P<path>+<place> DP]]]]]]] 
Similarly for the accusative/dative alternation structures in the dative dialects. The 
locative/dative prepositions are simple <place> prepositions, whereas the 
directional/accusative prepositions have an addded <path> layer triggered by the 
selecting res head. The rules for case assignment, are now stated as in (20) and 
(21). 

(20) P<place> assigns DAT 

(21) P<path>+<place> assigns ACC 


We can now give more detailed versions of (11) and (12), with the structure of the 
PPs made more explicit, see (22) and (23), respectively. 


(22) initP 
vi init’ 
init procP 
undersokt Bee PP 
pasienten proc’ eT 
oe ae P<place> DP 
pia pa lafta (DAT) 


<undersokt> 
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(23) initP 
vi init’ 


init procP 


la er 


pasienten proc’ 


ae Nee 


proc resP 


<la> BOs. 


<pasienten> res 


res pp 
<la> 


P<path><place> DP 
pa lafte (ACC) 


5. A potential problem: Directional dative? 


The analysis of the alternating prepositions in Norwegian dative dialects as it is 
developed thus far implies that dative is always correlated with a locative interpre- 
tation and accusative is always correlated with a directional interpretation. Thus, 
dative is never correlated with a directional interpretation, and accusative is never 
correlated with a locative interpretation.” Are these predictions true? As for accu- 
satives with a locative interpretation, they may be encountered in the dative 
dialects, but they should most probably be explained as instances of the general 
weakening of the use of dative in these dialects. Generally, the trend is that the da- 
tive is giving way to the accusative (especially among younger generations) so that 
the accusative is used both with directional and locative interpretations. 

However, this weakening effect is of course irrelevant when the dative is used, 
because in that case it is reason to believe that the regular alternating accusative/dative 
system is maintained, and then a locative interpretation is clearly expected when the 
dative is used. Still, it has been argued on the basis of German data that a directional 
dative in fact exists, contrary to expectation, cf. Smith (1995: 304ff.), where the notion 
of ‘end-point focus’ is invoked to explain such unexpected data. That is, the idea is that 
normally an end-point implies directionality and therefore accusative, but if the end- 
point is especially focused, this focus can license the used of dative on the DP. 

Translated to the Norwegian dative dialect under investigation, corresponding 
data seems to be found in Norwegian, too. (Notice that German has a general 


2. Notice that these predictions only apply to the alternating spatial prepositions. There are also 
non-alternating prepositions of a spatial nature that impose a particular fixed case, e.g. always 
accusative or always dative. These non-alternating prepositions are not investigated in this paper. 
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case-alternation system with spatial prepositions which is very similar to that 
found in Norwegian dative dialects.) Consider the data in (24). 


(24) a. Hainn festa lampa_ e takje. (directional) 
he attached lamp-the in ceiling.the.acc 
‘He attached the lamp to the ceiling’ 


b. Hainn festa lampa- e takja. (locative?, directional?) 


he attached lamp-the in ceiling.the.paT 
‘He attached the lamp in/to the ceiling’ 


(24a) is as expected; the interpretation is directional and the case is accusative. The 
structure is shown in (25). 


(25) initP 


init procP 


festa Ler 


lampa proc’ 


Bo 


proc resP 


<feesta> BER. 


<lampa> res’ 
res pp 
<faesta> 
P<path><place> DP 
e takje (ACC) 
The problem is (24b), where the dative is used. Since the DP is in the dative, the 
structure should be as shown in (26), and the interpretation should be locative. 


(26) initP 


init procP 
festa Be ae Pp 
lampa proc’ LaF PS 
Be ee P<place> DP 
proc e takja (DAT) 
<faesta> 


20 


Tor A. Afarli 


A paraphrase of this meaning could be something like "he attached the lamp and 
that happened in the ceiling”. That is, the attachment action took place in the ceil- 
ing. Still, this does not appear to fully convey the meaning here, since there is 
clearly a sense that even (24b) has a directional meaning even with the dative, ice. 
that the lamp was attached to the ceiling. 

Still, I tentatively suggest that the structure of (24b) is in fact (26), and that the 
structural meaning is locational, not directional, i.e. no res head is involved. To the 
extent that (24b) may have a directional meaning, I assume that is a so-called 
shadow interpretation (cf. Hoekstra 1988) which results from the way we usually 
interpret particular situation types. Thus, the shadow interpretation is not read off 
from the linguistic structure. Thus, if an act of attaching a lamp takes place, and 
this is located to the ceiling, the natural interpretation of the situation is that the 
lamp was attachet to the ceiling, which is a directional shadow interpretation, even 
though the strict structural interpretation is locational. In line with this, I surmise 
that Smith’s (1995) notion of end-point focus licensing dative is really a matter of 
a directional shadow interpretation imposed on a locative adjunction structure. 

The mode of explanation used here is similar to the use of the notion of shadow 
interpretation in Hoekstra (1988) to explain why small clause subject DPs 
sometimes are interpreted as direct arguments of the matrix verb, cf. (27). 


(27) a. Eg farga [haret — gult] 
I dyed hair-the yellow 
‘I dyed my hair yellow: 


b. Eg at [kjoleskapet tomt] 
I ate refrigerator-the empty 
‘T ate the fridge empty: 


In (27a), in contrast to (27b), the small clause DP seems to bear a direct argument 
relation to the matrix verb. However, this is not a structurally based interpretation, 
but rather it is a shadow interpretation, because situations of dyeing so that the 
hair becomes yellow naturally involves dyeing the hair as well. Structurally the 
small clause subject DP and the matrix verb bear the same structural relation in 
(27a) and (27b), ie. there is no direct structural argument relation between the 
matrix verb and the small clause subject in neither of the two cases. 

The reasoning in this section is supported by a consideration of additional 
data. My main claim is that (24b) may have a directional interpretation, but that 
interpretation can only be a shadow interpretation. Structurally, (24b) contains a 
direct object DP and an adjunct PP, and crucially, the postverbal DP and the PP 
do not constitute a small clause. Therefore, it is predicted that the PP cannot con- 
tain an anaphor taking the postverbal DP as its antecedent, even with the 
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directional interpretation. This prediction is in fact borne out, as shown by the 
contrast in (28).? 


(28) a. Vi festa lampa, pa plassen sin, (directional) 
we attached lamp-the on place.the.acc self-poss.acc 
“We attached the lamp to its position’ 
b. ‘Vi festa lampa, pa plassa sin, (locative?, directional?) 
we attached lamp-the on place.the.par self.poss.DAT 
“We attached the lamp in/to its position’ 


Clearly, if (28b) were a directional dative with a small clause structure, (28b) 
should have been fully grammatical. The fact that is is not, and that there is a rela- 
tively marked contrast between (28a) and (28b), indicates that only (28a) has a 
small clause structure, and that the structure of (28b) is as claimed above. 


6. Alternating prepositions in presentational structures 


Presentational structures are very common with unaccusative type verbs in 
Norwegian. This section investigates the analysis of alternating prepositions in pre- 
sentational constructions. Consider first some examples where unaccusative type 
verbs are used with alternating prepositions in non-presentational structures. 


(29) a. Hainn Ola ma ga _ pa Iafte. (directional) 
he Ola must walk on room.the.acc 
‘Ola must walk to the room? 


b. Hainn Ola sitt pa lafta. (locative) 
he Ola sits on room.the.DAaT 
‘Ola is sitting in the room? 


(30) a. Hainn Ola fer ni hagainn. (directional) 
he Ola go in garden.the.acc 
‘Ola walks to the garden: 


b. Hainn Ola er ni haga. (locative) 
he Ola is in garden.the.DaT 
‘Ola is in the garden’ 


(31) a. Hainn Ola kryp  puinn sengja si. (directional) 
he Ola creeps under bed.the.acc self:poss.acc 
‘Ola creeps under his bed? 


3. Notice that the contrast in (28) is quite weak. For reasons that are unknown to me, data 
involving anaphors are often murky, see e.g. Busterud (2010) on this general problem in an L2 
setting, see also Lodrup (2008). 
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b. Hainn Ola ligg puinn sengjenn _ sin. (locative) 
he Ola lies under bed.the.par self.poss.paT 
‘Ola is lying under his bed? 


In what follows, I will use the pair in (29) as a representative example, but what I 
say also applies to the other pairs, mutatis mutandis. 

Since the verbs used in (29) are unaccusative type verbs, they can be used in 
presentational structures, as expected in Norwegian, cf. (32). 


(32) a. Detma ga enkar pa lafte. (directional) 
it must walk a manon room.the.acc 
‘A man must walk to the room: 


b. Det sitt en kar pa lafta. (locative) 
it sits a manon room.the.paT 
‘A man is sitting in the room’ 


I will assume that the postverbal DP + PP sequence in both (32a) and (32b) con- 
stitutes a small clause, as indicated in (33), cf. Hoekstra (1988); evidence for this 
will be given presently. 


(33) a. Detma ga _ [zn kar [pa lafte]] (directional SC) 
it must walk a man on room.the.acc 
‘A man must walk to the room: 


b. Det sitt [en kar [pa lafta]] (locative SC) 
it sits a man on room.the.DAT 
‘A man is sitting in the room’ 


On the assumption that both the a-version and the b-version contain small claus- 
es, the analysis of (29a, b) is as follows: The subject DP in both (29a) and (29b) is 
raised to the matrix subject position from the SC subject position in underlying 
structures similar to those in (33). Notice that hainn Ola in (29) is substituted for 
en kar ‘a mam in (32)/(33) because of the definiteness effect which must be ob- 
served in presentational constructions. 

Is there any evidence for the small clause analysis in (32)/(33)? Consider the 
data involving anaphors in (34). 


(34) a. Detma ga _ [en kar, [pa lafte sett,]] (directional SC) 
it must walk a man on room.the.acc self.poss.Acc 
‘A man must walk to his room: 


b. Det sitt [an_kar, [pa lafta sin,]] (locative SC) 
it sits a man on room.the.parT self.poss.DAT 
‘A man is sitting in his room? 
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In both (34a) and (34b), an anaphor contained in the PP can take the postverbal 
DP as its antecedent, which constitutes evidence for the small clause analysis ac- 
cording to my previous reasoning, see Section 2. 

Given the reasoning above, notice carefully that these data are unlike those 
discussed previously, since here both the directional/accusative type and the 
locative/dative type contain small clauses, which means that the relevant PP is in 
the complement domain of the verb phrase in both types, and therefore the P is 
c-commanded by any head inside the verb phrase in both types, too. Now, given 
the analysis defended in Sections 2 and 3 where the small clause analysis is corre- 
lated with the directional/accusative type only, the crucial question seems to be the 
following. If both types involve a small clause structure in the presentational case 
(as well as in its “moved” version), why is there a directional/accusative vs. 
locative/dative difference in these cases? 

Theexplanationis quite straightforward. lassume that the directional/accusative 
versions contain verbs or have meanings that imply that there is a res head in the 
verb phrase, thus explaining both the directional meaning an the accusative case, 
cf. (21). On the other hand, the locative/dative versions contain stative verbs, cf. 
Dowty (1979: 67), Hale & Keyser (2005: 34ff.). According to Ramchand (2008: 33f., 
55f.), the verb phrase of stative verbs contains only an initP. Therefore, it is reason- 
able to assume that (33a)/(34a) has the structure in (35), whereas (33b)/(34b) has 
the structure in (36). 


(35) [DP [init [DP [proc [DP [res [,. DP PP]}]]]]]] 
(36) [DP [init [,. DP PP]]] 


In (36), there is no res head that can possibly impose the directional/accusative 
properties, even though the PP is a small clause predicate in the complement do- 
main of the verb phrase. In other words, examples with a structure like (36) have a 
locative reading (with the default dative case), even though they contain a small 
clause. Thus, the existence of a small clause structure is not a sufficient condition 
for the directional/accusative type, since a res head must also c-command and se- 
lect the PP in order to trigger the directional/accusative type. In (35), on the other 
hand, the PP is in the complement domain of a res head, a configuration that im- 
poses the directional/accusative property in the usual way. 

The discussion above looked at presentational pairs where the b-version is a 
stative verb. As we saw, the stative verb has just an init-projection in Ramchand’s 
system. There are also corresponding pairs where the b-version contains an activ- 
ity verb, see (37). (38) shows the corresponding presentational versions. 


(37) a. Hainn gar pa lafte. (directional) 
he walks on room.the.acc 
‘He walks to the room? 
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b. Hainn gar pa Iafta. (locative) 
he walks on room.the.DAT 
‘He is walking in the room: 


(38) a. Det gar enkar pa lafte. (directional) 
it walks a manon room.the.acc 
‘A man walks to the room? 


b. Det gar enkar pa lafta. (locative) 
it walks a manon room.the.DAT 
‘A man is walking in the room? 


In (38), the possibility that an anaphor in the PP takes the postverbal DP as its 
antecedent will indicate that we are faced with postverbal small clause configura- 
tions in both cases, see (39). 


(39) a. Det gar enkar, pa lafte sett,. (directional) 
it walks a man on room.the.acc self.poss.acc 
‘A man walks to his room: 


b. Det gar enkar, pa lafta sin,. (locative) 
it walks a manon room.the.par self.poss.DAT 


‘A man is walking in his room? 


We are now faced with the same problem as before. Both versions have a small 
clause structure where the preposition is c-commanded by aspectual heads in the 
verb phrase, but still there is a directional/accusative vs. locative/dative difference. 
As the reader may have guessed, the explanation follows the same logic as before. 
Even though the verb in (37b)/(39b) is not a stative verb, it still lacks a res head. 
Specifically, since it is a pure activity verb it has an init head and a proc head, but 
no res head. Therefore, a directional reading (and accusative case) cannot be 
imposed in the b-version. The a-version, on the other hand, has a selecting res 
head, and a directional reading is imposed. The two structures are sketched in 
(40)-(41). 


(40) [DP [init [DP [proc [DP [res PP]]]]]] 

(41) [DP [init [DP [proc PP]]]] 
The essential point here is that even though the PP in (41) is c-commenaded by 
both init and proc, there is no res head that can impose a directional reading and 


accusative case. Thus, both the stative presentational cases and the activity presen- 
tational cases are explained in the same way. 
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7. Conclusion 


The main points made in this paper are the following: 


- Alternating spatial PPs are locative/dative or directional/accusative. 

- The locative/dative version is the default version. 

- The locative/dative and directional/accusative versions are often embedded in 
different syntactic structures, where a locative/dative PP is an adjunct with the 
default properties and the directional/accusative PP is a small clause predicate 
in the complement domain of the verb phrase. 

- In the latter case, the directional/accusative properties are imposed by a se- 
lecting res head in the aspectual structure of the verb phrase, which presup- 
pose that the PP is a small clause predicate in the local c-command domain of 
the res head. 

- Inthe absence of a res head, a PP may be a small clause predicate in the com- 
plement domain of the verb phrase and still retain its default locative/dative 
properties. Thus, it is correctly predicted that we find presentational sentences 
with both stative and activity verbs taking a locative/dative PP as a small clause 
predicate. 
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Spurious topic drop in Swedish* 


Christer Platzack 
Lund University 


Like English, Swedish is a language with obligatory subjects in finite clauses. 
One of the exceptions to the rule that the subject must be pronounced is found 
when the subject is Topic dropped from sentence initial position, resulting in a 
V1 declarative with no visible subject. Since Topic drop in Swedish also involves 
dropping from first position of objects and complements of prepositions, it 
cannot be analyzed as a subject property per se. In this paper I will argue that 
null subjects may be due to a specific kind of Topic drop, here called spurious 
Topic drop, since deletion does not effect the left most element of the clause. As 
I will show, there are a number of instances in Swedish that can be subsumed 
under the rubric spurious Topic drop. In all cases, a subject is missing despite the 
fact that it is not in the leftmost position. As will be shown, spurious Topic drop 
is also found in relative clauses and in subject-first main clauses. 


1. Introduction 


I will argue that most cases (except ellipsis in coordination) with a non-pronounced 
subject in Swedish are instances of Topic drop, ie. deletion of a weak subject 
(referential weak pronoun or expletive) at the left periphery. See e.g. Haegeman 
(1990) and Sigurdsson (2011); see also Mornsj6 (2002). The subject may be topic 
dropped in Spec-CP (1b) but not in Spec-TP (1c), This example is based on Cole 
(2010: 285). The non-pronounced subject is indicated as “O”. 


* This paper has its origin in a discussion with Elisabet Engdahl concerning her papers Engdahl 
(2010, in press). Preliminary versions of my paper have been presented at Grammar in Focus in 
Lund, February 2011, and at a workshop about Uralic and Scandinavian languages, Lund August 
2011. Thanks to theaudiencies for valuable comments, anda special “thank you” to Elisabet Engdahl, 
Gunlég Josefsson, Terje Lohndal, Henrik Rosenkvist and Halldér Armann Sigurdsson for com- 
ments on a preliminary version. I take full responsibility for remaining errors. 
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(1) a. Varje morgon beséker John konstmuseet. Vad gor 
every morning visits John art-museum.DEF what does 
han pa kvallen? 
he in evening.DEF 
‘Every morning John visits the art museum. What does he do in the 
evening?’ 
b. Han/@ beséker universitetet 
he visits university. DEF 
‘He visits the university: 


c. Pa kvallen besdker han / *@ universitetet. 
in evening.DEF visits he university.DEF 
‘In the evening he visits the university’ 


The claim that Topic drop must involve Spec-CP seems to be violated by examples 
like (2), where an expletive subject appears to have been deleted in Spec-TP, to the 
right of V in the V2 position: 


(2) Pahotellrummet var (det) smutsigt. 
in hotelroom.DEF was it dirty 
‘It was dirty in the hotel room’ 


In my paper I will argue that (2) is a specific kind of Topic drop, here called spurious 
Topic drop, since deletion does not effect the left most element of the clause, but like 
ordinary Topic drop the involved position is a low Spec-CP. There are several similar 
cases in Swedish, that can be analyzed as spurious Topic drop, and I will ultimately 
claim that all main clauses with a stressed subject in first position may be candidates 
as well. In all cases, a subject may be missing despite the fact that it is not sentence 
initial. I will claim that the initial, stressed element in such cases (e.g. the PP in (2)) 
is in a high Spec-CP doubled by the dropped element in the low Spec-CP, hence a 
kind of hidden Left Dislocation. I will remain agnostic about whether or not the 
high Spec-CP can be further specified as one of the positions offered by the 
Cartographic version of the Minimalist framework, see Rizzi (1997) and Cinque and 
Rizzi (2010). I will also abstain from discussing the possibility that what I call the 
low Spec-CP is actually an edge-feature of T, see Frey (2010) and Section 2.3 below. 

My paper is organized in the following way. In Section 2 I present Chomsky’s 
(2008) account of A-chains and A-bar chains in terms of parallel movements, and 
show the relevance of this for Topic drop. I also present a Left Dislocation like pos- 
sibility in Swedish (see Faarlund (1980) and Eide (2011) for a discussion of similar 
data in Norwegian) to double an initial, stressed element with a weak pronoun or 
frame topic, assumed to be in low Spec-CP, and claim that Topic drop is the dele- 
tion of this weak element. Spurious Topic drop occurs when topic drop takes place 
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from the low Spec-CP in the presence of a high Spec-CP.! Sections 3 to 8 illustrate 
a number of cases that can be described in terms of spurious Topic drop. Section 9 
is a summary and a conclusion. 


2. Topic drop 


2.1. Introduction 


Repeating what I said in the Introduction, Topic drop is deletion (under recover- 
ability) of a weak element which heads an A-bar chain in Spec-CP. So far we have 
looked only at examples where the dropped element is a subject, but it is well 
known, as illustrated by Mornsj6 (2002) for Swedish, that also objects may be topic 
dropped. Mornsj6 (2002) also found examples in her spoken Swedish corpus 
where the dropped element was an indirect object, the object of a preposition, an 
expletive, a quasi-argument or a frame topic like hdr ‘here, dar ‘there, da ‘then, nu 
‘now’ and the adjunctive sa ‘so, then: See Sigurdsson (2011) for a recent discussion 
of possible and impossible contexts for Swedish Topic drop, and an attempt to ac- 
count for these restrictions in terms of intervention effects. 


2.2 Parallel movement 


The restriction that a topic dropped element may be left out only when it is in 
Spec-CP, see (1b/c) above, can be accounted for in terms of Chomsky (2008), mak- 
ing use of his suggestion that A/A-bar chains are formed by parallel movements. 
According to Chomsky, T inherits its features from C and forces the subject to 
move to Spec-TP, creating an A-chain,, and the Edge-feature of C forces the sub- 
ject to move to Spec-CP, creating an A-bar chain. In a sentence like (3a) with the 
structure (3b), both subject movement to Spec-TP and wh-movement to Spec-CP 
are indicated by parallel arrows. 


(3) a. Who saw him? 
b. Who, [, [ whe, Ly Lp whe [vp saw him]]]]]] 
ar 


In Chomsky’s words, “[t]he @-features of C seek the goal who and raise it to SPEC-T 
(by means we still have to determine), and the edge-feature of C can also seek the 
goal who and raise it in parallel to SPEC-C”. 

Since there is parallel movement, it follows that there is no direct connection 
between who in Spec-CP and who in Spec-TP. Instead, there is one A-chain and 


1. Cf. Teleman et al (1999, Vol. IV, pp. 417-418 and Chapter 36). 
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one A-bar chain. As seen in (3a), only one of the wh-words is pronounced, but 
since the finite verb in English stays low, we cannot say for sure if the pronounced 
who is in Spec-CP or Spec-TP. Chomsky (2008) states that the pronounced entity 
is who, in Spec-CP, and I will follow this suggestion here.” In the corresponding 
Swedish case, V2 guarantees that the tensed verb is in C, hence the pronounced 
wh-word must be the one in Spec-CP, as is seen in (4): 


(4) a. Vem sag honom? 
who saw him 
“Who saw him?’ 


b. [.pvem, [, sag] [,p vera T [,, vem v [p sag honom]]]] 


Notice that this parallel movement has the important consequence that a weak 
element in Spec-TP cannot be deleted. It is the pronounced element in Spec-CP 
that is deleted by Topic drop, nothing else. Hence in an ungrammatical case like 
the choise of @ in (1c), no parallel chain is present, and attempts to delete the sub- 
ject in Spec-TP leads to ungrammaticality. 

Applying Chomsky’s parallel movement hypothesis to examples like (1b), 
i.e. the case with subject topic drop, here repeated as (5a), the result is the following 
structure (5b), where dashed arrows indicate the parallel A- and A-bar chains. 


(5) a. Han /@ besokte universitetet 
he visited university.DEF 
‘He visited the university: 


s 


CP 


) 


C 
ao 
C TP 
besdkte ee Or 


DP T’ 
han ee 
% ap vP 
_ % ee 
g ‘ DP Vv 
eS | He oe 
han oy vP 
besbkte 
Vv DP 


besékte —_ universitetet 


2. Inthe case the first wh-word is an object, as in (i), it is obvious that the pronounced wh-word 
is in Spec-CP: 


(i) Who did he see? 
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In the case with Topic drop, Spec-CP is not pronounced, although we will assume 
with Mérnsj6 (2002) that the subject is fronted to this position in syntax, and that 
the deletion takes place in PE This deletion is indicated in (5b) by a stroke through 
han in Spec-CP; the other elements stroken through are copies. The same analysis 
is assumed for cases with fronted and deleted Frame topics, see the examples in 
(6), taken from Mérnsj6 (2002: 104,106). 


(6) a. Vadgordu nar bilen inte vill startag — Da/@ sparkar 
what do you when car.DEF not want start then — kick 
jag pa den. 

I at it 
“What do you do when the car does not start? — I kick it? 


b. Johan stod i liftkén. Sa/@ kom en kille och kérde. 
Johan stood in liftline.DEF so came a guy and ran 
pa han 
into him 
Johan was standing in the lift line. Then a guy came and ran into him: 


We do not have to take a stand whether the frame topic is moved to Spec-CP or is 
first merged in this position. The important thing is that such an element, like 
personal pronouns and expletives, may be deleted in Spec-CP. 


2.3. Two Spec-CPs in Swedish 


Prosodically, a Swedish main clause may be introduced by a stressed or a 
non-stressed constituent, preceding the finite verb. When the first element is 
stressed, it may, but must not, be separated from the verb by a prosodic break. In 
(7), stress is indicated by capital letters. 


(7) a. JOHAN/Johan har kopt en ny cykel. 
John has bought a__ new bicycle 
John has bought a new bike: 
b. DEN/ Den stallde hani kéket.? 
that it put he in kitchen.pEF 
‘He put it in the kitchen’ 


3. Frey (2010) suggests that there are two ways in which the prefield may be filled in German, 
by A-bar movement or by Formal movement, which is movement of the highest element of the 
sentence to the edge of the middle field, i.e. TP in my account. Stress is only associated with 
A-bar movement. This hypothesis cannot be directly taken over in an analysis of Swedish, how- 
ever, since a weak unstressed object pronoun may be fronted without being the highest element 
of the sentence, see (7b). 
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When a full DP or a stressed pronoun is in first position, it may be followed by a 
co-referent unstressed pronoun (underlined), but not the other way around (8a). 
Similarly, a sentence initial adverb may be followed by an unstressed adjunctive sd, 
as in (8b), but not vice versa: 


(8) a. CYKELN/Cykeln/ DEN, den stallde hani  koket. 
bike.DEF it it put he in kitchen.pEF 


b. Sedan sa kommer min syster. 
then so comes my _ sister 
“Then my sister comes: 


I will take (8) to indicate that there are two Spec-CP:s in a Swedish main clause, 
where the high/first specifier contains a first merged, usually stressed, element, 
and the low/second specifier is a position for an internally merged or a first merged 
unstressed element. Taking silence to be the ultimate stresslessness, I will further 
assume, as mentioned above, that the topic dropped element targets the low 
Spec-CP. Example (8a) with doubling den has the structure in (9): 


(9) CP 
DP CP 
CYKELN ae ee 
DP C’ 
den ee ee 
C TP 
stallde ge 
DP T 
han See 
T vP 
han-stallde-den i koket 


As Henrik Rosenkvist (p.c.) has remarked, the DP in the high Spec-CP seems to 
be outside the clause proper, which begins with the low CP. See Teleman et al 
(1999, Volume IV, Chapter 36). In line with this, I will claim that there is no syn- 
tactic relation between the high Spec-CP and the low CP, only semantic/textual 
relations.* 


4. According to Eide (2011), what I call the high Spec-CP is the first position in the Copy Left 
Dislocation construction, where there is no prosodic break between the initial phrase and the 
copy. When there is a prosodic break, we have a Hanging Topic Left Dislocation, see Grohmann 
(2000: 140), and also Faarlund (1980). Both these Spec-CPs are higher than the low Spec-CP 
where Topic Drop is taking place. In the discussion below I will not distinguish between the Hang- 
ing Topic Left Dislocation and Copy Left Dislocation; both will be identified as the high Spec-CP. 
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2.4. Proposal 


Summarizing, I propose that Topic drop takes place from the low Spec-CP. What I 
call spurious Topic drop is the case where the low Spec-CP is dropped in the pres- 
ence of a pronounced high Spec-CP. In Sections 3 to 8 below I will present some 
Swedish constructions where spurious Topic drop seems to be operative. 


3. Initial locative and invisible subject in Swedish 


So far we have hypothesized that the subject is obligatorily visible in Swedish, except 
when topic dropped.° Examples like (10a) are surprising from this perspective, since 
here the subject seems to be missing in Spec-TP, granting that the fronted PP is in 
the low Spec-CP. Our hypothesis predicts, contrary to facts, that only (10b), where 
an expletive pronoun is present in Spec-TP, should be well-formed: 


(10) a. I graset kan finnas ormar. 
in grass.DEF may be snakes 
There might be snakes in the grass. 


b. I graset kan det finnas ormar. 
in grass.DEF may it be snakes 
“There may be snakes in the grass: 


Cases like (10a) have been discussed in depth by Falk (1993: 284ff.) and has recently 
been mentioned by Engdahl (2010). According to Falk (1993: 284), the subject posi- 
tion (Spec-TP) shares index with the fronted locative and may therefore remain emp- 
ty.° Falk’s analysis has for a long time been the only available one of this construction, 
even if it is not fully understood why index sharing licenses an empty subject position. 
Besides, indices are no longer part of the theoretical framework, and we might there- 
fore look for an alternative to Falk’s proposal. I will present such an alternative below, 
claiming that the mechanism behind this type of null-subject sentences in Swedish is 
the same as the mechanism that accounts for Topic drop, building on my hypothesis 
that there are two Spec-CP:s in the left hand part of the Swedish main clause. 

It should be noticed that my analysis below does not answer the question why 
this type of null subject mainly occurs with fronted locatives; compare (10) with 


5. When topic is dropped, the subject is not invisible qua subject, but as a consequence of be- 
ing an unstressed element in the lower Spec-CP. 


6. Falk (1993: 284): “[T]he subject position is empty and licensed by the topicalized phrase, 
but at the same time connected with the VP-internal argument”. 
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(11), where tense adverbials have been fronted: (11b) is more or less acceptable, 
whereas (11a) is out, at least in my grammar: 


(11) a. Pa sommaren’ kan det/??@ finnas ormar i graset. 
in summer.DEF may it be snakes in grass.DEF 
‘In the summer there may be snakes in the grass: 


b. I tre timmar har det/?@ pagatt ett m6dte _pa torget. 
for three hours has it taken place a meeting in square.DEF 
‘For three hours, a meeting has taken place in the square? 


Falk (1993) suggests that the expletive may be invisible just in case there is a sec- 
ondary predication in the structure to which the fronted element is linked, and 
this is the case with locative adverbials, but only occasionally with temporal adver- 
bials and other VP adverbials; see e.g. Falk (1993: 286) for some examples. For this 
part of the problem, I have no alternative solution. 

We will now turn to my account of the invisible expletive in cases like (10a). I 
will claim that the expletive in (10a) is invisible for the same reason as the subject 
may be invisible in (5): it has been deleted after being merged in the low Spec-CP, 
see the structure in (12) below. As we saw in connection with (5), an element in the 
low Spec-CP may be left out, yielding (10a) with the structure (12). The main 
structural difference between (5) Topic drop and (12) spurious Topic drop is the 
presence in (12) of a high Spec-CP. 


(12) CP 
PP CP 
i graset ae 
DP Cc’ 
det wr ee. 
Se ee TP 
‘\ kan ee 
pp 1 
‘\ det AAO 
sy T vP 
ae Bee 
\\. DP v 
Sy det aaa et 
Vv VP 


Independent support for the analysis in (12) is provided by the following observa- 
tion. There is an asymmetry between (10a) and (10b), not observed by Falk (1993) 
nor by Engdahl (2010): only the case with visible expletive, ie. (10b), can be 
constructed with adjunctive sd, hence (13b) is well-formed, but not (13a): 
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(13) a. *I graset sa kan  finnas ormar. 
in grass.DEF so might be snakes 
“There might be snakes in the grass: 


b. I graset sa kan det finnas ormar. 
in grass.DEF so might it be snakes 
“There might be snakes in the grass: 


Adjunctive sé is found in all Scandinavian languages except in Icelandic, and it has 
a correspondence in German but not in English, see Ekerot (1988: 230). It is typi- 
cally used immediately in front of the finite V2-verb in main clauses, and follows 
an adjunct which “comments the utterance by stating under what circumstances 
something is done, or holds, or when it takes place, or whom or what it concerns 
etc.” (SAOB S, 15835; translation in Nordstrém (2010: 38)). Ekerot (1988: 57) ar- 
gues that adjunctive sd is an expletive in the preverbal position, “that is to say a 
grammatical morpheme of the same type as the expletive subject det ‘it, there’ 
Both morphemes are unstressed, lack semantic content and function as support to 
the word order” (translation in Nordstrém (2010: 49)). This analysis is congruent 
with the hypothesis argued for above, that there are two Spec-CP:s, the high one 
being first merged in this position and only related to the rest of the sentence as 
part of its context.” Furthermore, sd is always unstressed, hence presumably in the 
low Spec-CP. See Eide (2011) for similar properties of Norwegian sd. 

Given the structure in (12) with double Spec-CP:s, and the assumption that sd 
is in the low Spec-Cf, it is obvious that sa is prevented from occurring in (13a) as a 
consequence of there being an instance of the expletive subject in the low 
Spec-CP; remember that the subject cannot be invisible unless it is moved in paral- 
lel both to Spec-TP and Spec-CP. Hence a prerequisite for an invisible expletive 
subject is the availability of the low Spec-CP. Thus, either the low Spec-CP is filled 
by sd, and there is a visible expletive subject in Spec-TP, as in (13b), or there is paral- 
lel movement of the expletive to Spec-TP and to the low Spec-CP, including deletion 
of the expletive. In the second case there is as predicted no place for sd, see (13a).° 


7. However, as Nordstrom (2010, Section 3) shows, sd is not restricted to occur after adjuncts 
but may also (marginally) be preceded by elements like VP adverbials and prepositional objects 
“clearly more integrated in the clause nucleus than adjuncts, since the choice of preposition is 
not free, but governed by the valency of the predicate. These facts are arguments against the idea 
that the first element is adjoined to the clause and that sd is an expletive in the preverbal posi- 
tion. Rather it seems that the position to the left of sd isa clause-internal one to which [unbounded 
constituents and also] certain bounded constituents can be moved.” Nevertheless, I will insist on 
the analysis that a phrase in the high Spec-CP is first merged in this position. 


8. ‘This analysis suggests an answer to the question why only locative phrases may combine with 
an empty expletive; compare (10) and (11). This restriction would follow if only locatives in the 
high Spec-CP may take an empty low Spec-CP, whereas other fronted VP-adverbials cannot tar- 
get the high Spec-CP unless there is a visible or invisible sé or frame topic in the low Spec-CP. 
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Thus, the account in terms of spurious Topic drop that has been presented 
here has at least two advantages over Falk’s analysis: it does not call for any other 
tools than already present in the account of Topic drop (no licensing as a result of 
co-indexing, for instance), and it offers an immediate explanation of the observa- 
tion that adjunctive sd cannot be used in examples like (13a). 


4. The Engdahl observation 


Engdahl (2010) observes another case of null-subject in Swedish, illustrated by the 
example in (14), where parantheses around the second det ‘it’ indicate that it may 
be unpronounced. 


(14) Det var (det) bra att du sa. 
that was it good that you said 
‘Tt was good that you said that? 


Note that the first det in (14) corresponds to the object of the embedded clause, 
and that there may be no visible subject in the matrix. If our hypothesis is cor- 
rect that except for certain cases with coordination, an invisible subject can only 
occur as an instance of Topic drop, then an analysis in terms of spurious Topic 
drop seems called for. Such an analysis will state that the initial det ‘it’ is first 
merged in the high Spec-CP. This det is doubled by a det in the low Spec-CP, 
heading an A-bar chain that has been constructed in parallel with an A-chain, 
likewise headed by det.? This is the structural prerequisite for spurious Topic 
drop: the det in the low Spec-CP constitutes the pronounced part of the two 
parallel chains, and as a weak element in the low Spec-CP, it may be dropped, 
giving the impression of a null subject. The structure of (14) is presented in 
(15):1° 


9. Due to EPP in T, there must be an A-chain connecting det in Spec.TP and Spec-vP. 


10. As Engdahl (in press) remarks, we do not expect the unbounded dependency between the 
initial det and the embedded object gap to be semantic in nature, as it must be if initial det is first 
merged in the high Spec-CP. To avoid this problem, I will claim that the invisible object must 
move to Spec-CP of the embedded clause (escape hatch), providing an ordinary syntactic bind- 
ing of the gap. The invisible binder, in its turn, is now in a position where it may be identified 
semantically by the initial det. This analysis is not illustrated in the tree diagrams. 


Spurious topic drop in Swedish 37 


(15) CP 
DP CP 
det ae 
DP Cc’ 
det 
. in ee 
XC TP 
‘\var pee 
2 DP aly 
\ det 
Sy die 
: oe T vP 
Oy. P Vv 
\ det 
Vv AP 
Mae bra [., att du sa des] 


In the absense of spurious Topic drop we must have a visible expletive: 


(16) Det var det bra att du sa. 
that was it good that you said 


In this case there are no parallel chains based on the expletive. The extracted object 
of the embedded clause has been moved to the low Spec-CP of the matrix clause 
in this case, preventing parallel movement of the matrix subject and hence making 
spurious Topic drop impossible. 

Most examples of type (14) that Engdahl (2010) found in her corpus have det 
in first position. However, as Engdahl remarks (2010: 99), this is not necessary, as 
examples (17a,b) show. Notice that the predicate adjective in these examples may 
not agree with the fronted phrase, see (17c); agreement indicates an invisible sub- 
ject dom in Spec-TP, but that would violate the restriction that Swedish can only 
have invisible subjects that are Topic dropped. 


(17) a. Honom ar latt att roa. 
him is easy to enyoy 
‘It is easy to make him glad’ 
b. Dom var langesen jag sag (Engdahl 2010, ex. (47a)) 
them was long-time-since I saw 
‘T haven't seen them for a long time? 


c. *Dom var _ lustiga att jag sag. (cf. Engdahl 2010, ex. (48a)) 
they were funny.3RpD PL that I saw 


Examples like (17a,b) have the analysis presented in (18): 
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(18) CP 
DP CP 
honom Pa 
DP Cc 
det 
4 ae 
se € TP 
‘\ ar ee eS 
BP T’ 
\ det 
Sy po 
x — T vP 
SDP Vv 
\ det ger Pe 
v AP 
ax latt [.p att roa henena] 


Now consider the ungrammatical (17c). In this case the adjective agrees with the 
fronted element dom, which presumably means that dom must have been in 
Spec-vP of the matrix clause. But this is not an A-bar position, hence this position 
is not available for an A-bar moved element like dom in (17c). 

Alternatively, dom could be first merged in Spec-vP and hence being the sub- 
ject of the matrix clause. In this case, dom cannot also be the complement of sdég in 
the embedded clause. Interestingly, however, if we change the final clause in (17c) 
into an infinitival phrase, we get a grammatical result: 


(19) Dom var  lustiga attse dom 
they were funny.3rd pL to see them 


This is a case of Tough movement, not available if the moved element starts in a 
finite domain, as we saw in (17c). A possible structure for the Tough Movement 
case is outlined in (20), where the structure contains parallel chains: 


(20) [<p dom var [,, dem T [,, dens var lustiga [,, att [) se dena] ]]]] 
ny 


As (20) indicates, we seem to have another kind of parallel A- and A-bar chains 
here than in the previous cases, where both chains have their foot in the same ele- 
ment. Nevertheless, spurious Topic drop seems to be operative in (20) as well, in- 
dicating that it does not matter where the chain starts, given that the head of the A 
chain and the A-bar chain are identical. Whether or not this idea is fruitful to use 
in the description of Tough Movement will be explored in a separate paper. For a 
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recent discussion of Tough movement, see Rezac (2006). See also Engdahl (in press) 
for a deeper discussion of missing subjects in such contexts. 

If the low Spec-CP is not available for dom, e.g. as a result of there being a 
question operator in this position, no parallel chains are possible, and hence no 
spurious Topic drop; the result is that the subject dom must be visible: 


(21) Var dom lustiga attse dem? 
were they funny.3RD PL to see them 


The unpronounced second dom gets its interpretation from the higher dom 
through movement to Spec-CP of the infinitival clause. 


5. Clause anticipating pronoun 


Another case of spurious Topic drop is found in examples like (22a), where the 
matrix clause subject is spuriously Topic dropped. The corresponding case with- 
out spurious Topic drop is given in (22b). Semantically, the matrix subject det in 
cases like (22) anticipates the that-clause in final position, as is clearly seen in 
(22c). The corresponding cases with question matrix clause are given in (22d) and 
(22e). The structure of (22a) is given as (22f): 


(22) a. Att du sa_ sa var bra. 
that you said so was good 
“That you said so, it was good: 


b. Att du sa sa det var bra. 
that you said so it was good 
“That you said so, it was good: 


c. Det var bra att du sa_ sa. 
it was good that you said so 
“That you said so, it was good? 

d. Var det bra att du sa_ sa? 
was it good that you said so 
“Was it good that you said so?’ 

e. Att du sa sa var det bra? 
that you said so was it good 
“That you said so, was that good?’ 
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f. CP 
DP CP 
[cp att du sa a 
DP C’ 
det 
. we Oe, 
SC TP 
S.var go 
“OP T’ 
det 
af ee 
SON, T vP 
“SDP v 
\ det 
Vv AP 
var bra [,, at+du-sa-sa] 


When the that-clause is in the high Spec-CP, the anticipating det cannot be real- 
ized in Spec-TP, further illustrated in (23):!! 


(23) a. Att du sa_ sa var (*det) bra. 
that you said so was it good 


b. Att vargen ar skjuten beklagades  (*det) av alla. 
that wolf.pEF is shot _was-regretted it by all 
“That the wolf is shot was regretted by everyone’ 


The that-clause must be in the high Spec-CP, since it is not weak enough to qualify 
for the low Spec-CP. The low Spec-CP is filled by anticipating det, moved in paral- 
lel to Spec-TP and to the low Spec-CP. In this position it may be spuriously topic 
dropped, or it may be pronounced, see (22b). Since there is parallel movement, 
Spec-TP must always be invisible, This accounts for the fact that unlike the 
spuriously Topic dropped subjects in the other cases we look at, the drop of det is 
obligatory in cases like (23). Notice that there is no spurious Topic drop in the 
question cases (22d,e); in these cases, there is presumably a question operator in 
the low Spec-CP, preventing parallel movement. In (22e), there is a strong 
comma-intonation after the initial that-clause. 

Falk (1993: 102fn) has observed that in examples like (24), where the matrix 
verb is a non-factive passive, drop of det is optional: 


(24) Att vargen 4r skjuten sas (det) pa radion _igar. 
that wolf.DEF is shot _was-said it onradio.DEF yesterday 
“That the wolf was shot was said on radio yesterday: 


11. The exception is when the matrix clause is a yes/no question, as shown in (22e). 
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I will not dig deeper into the difference between (23) and (24); for the purpose of 
this paper it is enough to note that the fronting of an anticipated clause leads to 
structures where spurious Topic drop may take place. 


6. Quantifier scope and expletives 


In this section we will look at another case that seems to involve spurious Topic 
drop. Compare the examples in (25): 


(25) a. Det kom inte manga pa métet. *many > not, not > many 
it came not many to meeting.DEF 
“There didn't appear many at the meeting’ 


b. Manga kom det inte pa métet. *many > not, not > many 
many came it not to meeting.DEF 
“There didn't appear many at the meeting’ 


c. Manga kom inte pa motet. many > not, not > many 
many came not to meeting.DEF 
‘Many didn’t come to the meeting?/“There didn’t come many to the 
meeting: 


As observed by Sundman (1980: 130-131), the examples with an expletive subject 
det, i.e. (25a,b), both have the single meaning that few people attended the meet- 
ing. Hence, not takes scope over many even in the case where many is in sentence 
initial position: the interpretation indicates restructuring at LE This reading is also 
available in (25c), which lacks an overt expletive. (25c) is ambiguous, however, 
also having the reading that there were a lot of people that did not attend the meet- 
ing, thus a reading where many takes scope over not. 

The difference between the two readings of (25c) seems to correspond to dif- 
ferent restructuring: either many is interpreted in Spec-TP,” preceding the nega- 
tion and thus having the reading many > not, or many is interpreted in its base 
position within vP, given the interpretation not > many (as in (25a) and (25b)). 

Notice that the interpretation not > many of (25c) seems to presuppose that the 
subject position Spec-TP is not available for manga ‘many’ in this case, hence this 
interpretation seems to violate the strong subject requirement in Swedish. However, 
an alternative would be to assume that this reading of (25c) is the result of spurious 
Topic drop: in the not > many reading of (25c), there is an expletive subject, just as 
in (25a,b), but in (25c) this expletive subject is spuriously Topic dropped. 


12. Note that there are parallel A and A-bar chains in (25c), both headed by manga ‘many’. See 
(27) below. Since manga ‘many’ heads a chain in Spec-CP also in (25b), I take for granted that 
the many > not interpretation of (25c) has nothing to do with Spec-CP. 
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If it is correct to analyze (25c) as a case of spurious Topic drop when it has the 
reading not > many, Spec-TP in this case must be filled by an invisible expletive 
subject, not the quantifier manga. If so, subject-verb inversion should not be avail- 
able with the not > many reading. As (26) shows, this is a correct prediction: when 
manga is inverted with the verb, we strongly prefer the reading that there was a 
great number of people not attending the meeting. 


(26) a. Darfor kom manga inte pa motet. many > not, *not > many 
therefore came many not to meeting.DEF 
“Therefore many did not come to the meeting’ 


b. Kom manga inte pa motet? many > not, *not > many 
came many not to meeting.DEF 
‘Did many not come to the meeting?’ 


The two readings of (25c) correspond to the following structures: 


(27) a. CP many > not 


manga C’ 


Be 
m TP 


| 

ko 

ee oe 

| 2 - 

1 manga T 

| 

a Se 

em NegP 

| NK 

ae ; wee 
| ‘inte vP 
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hat fee exe ee \ mangakem pa métet 


ay inte vP 
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b CP not > many 
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The structure with spurious Topic drop is the one in (27b). 


7. Split topicalization 


The next type of spurious Topic drop that I will discuss is found when the subject 
DP is split and pronounced both in the left periphery and within vP, a case of so 
called Split topicalization. Both subjects and objects can be split, see (28a) and 
(28b). This construction is described among others by van Hoof (2005) and 
Molnar & Winkler (2010), see also Platzack (2012a). The non-split counterparts 
are presented in (29). Stress is indicated by capital letters in (28). In (29), stress is 
not indicated. 


(28) a. BILAR har han MANGA. (Split object) 
cars has he many 
‘Cars he has many. 


b. STUDENTER finns MANGA i Lund (Split subject) 
students are many  inLund 
“There are many students in Lund? 


c. BILAR har han MANGA men CYKLAR (har han) bara EN. 
cars has he many but bikes has he only one 
‘Cars he has many, but bikes only one: 


(29) 


> 


Manga bilar har han. 
many cars has he 
“Many cars he has’ 


b. Manga studenter finns i Lund. 
many students are in Lund 
“There are many students in Lund? 


c. En cykel har han bara, men manga bilar. 
one bike has he only but many cars 
‘He has only one bike, but many cars’ 


In line with what I have said above, stress on the initial element indicates that it is 
first merged in the high Spec-CP. Notice also that (28c) supports the first merge 
analysis: the initial part cyklar ‘bikes’ is in the plural, but the numeral en ‘one’ in 
front of the non-initial part is only compatible with the singular form cykel ‘bike’; 
cf. (29c). Hence, no movement targeting Spec-CP has taken place. 

Examples of the type illustrated in (28b) are candidates for spurious Topic 
drop. At the first glance, the split topic seems to be (part of) the subject of the 
clause, even if we cannot directly tell, since first position is not a reliable subject 
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indicator in Swedish. A clear diagnostics is appearance in the position immedi- 
ately to the right of the finite verb in direct yes-no-questions, i.e. Spec-TP. As 
shown in (30a), the first part of the split topic cannot be in this position, whereas 
the unsplit subject can (30b). In the unsplit case, Spec-TP can be filled by an exple- 
tive (30c). This is also an option in the Split Topic case, as seen in (30d). 


(30) a. *Finns studenter manga i Lund? 
are students many in Lund 


b. Finns manga studenteri Lund? 
are many students in Lund 
‘Are there many students in Lund?’ 


c. Finns det manga studenteri Lund? 
are there many students in Lund 
‘Are there many students in Lund?’ 


d. Studenter finns det manga i Lund. 
students are it many in Lund 
“There are many students in Lund? 


The lack of inversion in (30a) indicates that the subject position in Spec-TP is oc- 
cupied by an invisible element. Presumably, this element is an expletive, as indi- 
cated in (30d). For this expletive to be unpronounced, as in (28b), we must have 
parallel movements of an expletive, to Spec-TP and to the low Spec-CP. As we have 
seen several times already, this lays the ground for spurious Topic drop. Such an 
analysis would account for (28b). 


8. Additional cases with spurious topic drop 


In this section I will mention two further cases where spurious Topic drop may be 
at hand, relative clauses where the subject has been relativized, and subject initial 
main clauses. 


8.1 Relative clauses 


If something like spurious Topic drop exists, it is presumably operative in more 
cases than I have looked at so far. When spurious Topic drop has applied, we have 
a finite clause without a pronounced subject, seemingly violating the strict obliga- 
tory subject condition in Swedish. Since relative clauses where the subject is rela- 
tivized fulfill this requirement, they are candidates for having been formed with 
the help of spurious Topic drop. Consider the relative clause in (31): 
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(31) ..pojken som 6ppnade fonstret 
boy.DEF that opened window.DEF 


The head of the relative clause, in this case pojken ‘boy.DEF; is outside the relative 
clause proper. This is introduced by the relative complementizer som ‘that.REL} 
presumably in C. There is no visible subject in the relative clause, although we 
know that there is an invisible one, co-referent with the head of the relative clause. 
Various ways have been suggested to account for the presence of an invisible 
subject in subject relative clauses in languages like English and Swedish, that lack null 
subjects. Since Kayne (1994: 91), the standard solution is to assume that the relative 
clause is the complement of D and that the head of the relative clause is moved to 
Spec-CP, as illustrated in (32) for the sequence the man that arrived yesterday: 


(32) DP 
D CP 
the ws: 
man gee 
that TP 


man arrived man yesterday 


Whereas Kayne’s analysis has several virtues, it also faces problems, as observed 
e.g. by Wilder (1995), Borsley (1997) and Platzack (2000), among others. The main 
difference between Platzack (2000) and Kayne (1994) concerns the status of the 
relative head. According to Kayne (1994), the relative head is raised from a posi- 
tion inside the clause to Spec-CP, whereas Platzack (2000) assumes that it is first 
merged as an N taking the relative clause as its complement. This is illustrated in 
(33) for the same example as (32).° 


(33) DP 


that TP 
saan arrived man yesterday 


I will assume that (33) is the correct structure. If so, the absence of a visible subject 
in subject relatives can be accounted for in terms of spurious Topic drop: notice 


13. According to Kayne (1994: 155, footnote 17), the structures in (32) and (33) are the only 
ones for relative clauses that are compatible with the antisymmetric hypothesis. 
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that in (33) invisible man has moved in parallel to (the low) Spec-CP and to 
Spec-TP, which is a prerequisite for spurious Topic drop to take plase. 

Note finally that no parallel movement takes place in object relatives. One 
example is given in (34), where the arrow indicates an A-bar chain between 
(the low) Spec-CP and the first merged position of the object. In Spec-CP, the ob- 
ject is deleted, presumably due to the same mechanism as in ordinary Topic drop. 


(34) pojken [pp Benes som [,, Lisa T[,,, Lisa gillade honem]]] 
boy.pEF him that Lisa Lisa liked him 


tT 


The object of the relative clause is assumed to be a personal (resumptive)'* pro- 
noun honom ‘him, with the same reference as the relative head.!° 


8.2 Subject initial main clauses 


Finally, we will take a look at a potential case of spurious Topic drop where there are 
no traces at the surface that this operation has applied. On the other hand, given the 
availability of spurious Topic drop, it is hard to see how grammar could prevent this 
possibility. Ordinary subject initial main clauses constitute such a case. 

As soon as an element is first merged as the high Spec-CP, it may be doubled 
in the low Spec-CP, paving the way for spurious Topic drop. I will here illustrate 
this option with ordinary main clauses introduced by a definite subject, showing 
how they might be candidates for a spurious Topic drop analysis. 

Consider the sentence in (35): 


(35) Mannen 6ppnade dérren. 
man.DEF opened door.DEF 
“The man opened the door? 


Given the technical machinery introduced above, (35) is either analyzed as in 
(36a) or (36b); we will disregard stress in our discussion here. 


(36) a. [cp Mannen [.,6ppnade] [_, mannen T |, mannen-dppnade dérren ]]]] 
t —— 
b.  [.p Mannen [,, ban [, 6ppnade] [,, han T [,, han-Gppnade dérren ]]]] 


t as, 


14. Note that the pronoun may be visible: 


(i) pojken, honom som Lisa gillade 
boy.DEF him that Lisa liked 
15. As long as this pronoun does not appear as a resumptive pronoun, it is presumably lacking 
phonetic features. 
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Notice that we have parallel A and A-bar chains in both cases, headed by mannen 
‘the mar’ in (36a), and by han ‘he’ in (36b). As we have seen, in such a situation the 
element in Spec-TP cannot be pronounced (*mannen dppnade mannen dorren ‘the 
man opened the man the door’ etc.), whereas the element in the low Spec-CP is 
either (spuriously) topic dropped, as in (37a,b) or there is no spurious Topic drop 
as in (38), where we have an instance of Left dislocation, either Copy Left 
Dislocation, when there is no prosodic break between the initial DP and the pro- 
noun, or Hanging Topic Left Dislocation when there is such a prosodic break: 


(37) a. Mannen 6ppnade dorren. 
man.DEF opened door.DEF 
“The man opened the door: 


b. (Vad gjorde mannen?) @ 6ppnade dorren. 
what did man.DEF opened door.DEF 
‘What did the mando? — Opened the door’? 


(38) Mannen (#) han 6ppnade dorren. 
man.DEF he opened door.DEF 
“The man, he opened the door’ 


A consequence of this description is that an ordinary subject initial main clause 
like (35) is structurally ambiguous, with the high Spec-CP activated in one of its 
analyses. 


9. Summary and conclusion 


In this paper I have proposed an analysis of Topic drop in Swedish, based on 
Chomsky’s (2008) suggestion that A- and A-bar movement take place in parallell: 
in a case like a subject first main clause, the subject is the head both of an A-chain 
and an A-bar chain, and it is pronounced as the head of the A-bar chain, 
presumably in Spec-CP. It is suggested that Topic drop is deletion in Spec-CP. 

Observing that a topicalized element in Swedish may be either stressed or un- 
stressed, and that there are cases where a stressed version of an element immedi- 
ately precedes the unstressed version of it, but not vice versa, it is suggested that 
Swedish has two Spec-CPs, where the high one may host stressed elements and full 
phrases of various types, whereas the low one is restricted to un-stressed DPs, 
pronouns and adverbs. ‘The position that is unexpressed in the Topic drop case is 
identified as the low Spec-CP. 

Although Swedish, like English, usually must have a visible subject in main 
clauses (Topic drop and coordination ellipsis not counted), other cases have 
occasionally been observed where the subject may be empty. One case is 
caracteristically introduced by a locative adverbial, see (39): 
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(39) I graset kan det/@ finnas ormar. 
in grass.DEF may it/® be snakes 
“There may be snakes in the grass’ 


In spoken Swedish, a fronted adverbial is often followed by an adjunctive sd, as in 
(40); in many cases with V1 declaratives, there seems to be an unpronounced ad- 
junctive sd in front of the finite verb. 


(40) I graset sakan *(det) finnas ormar. 
in grass.DEF simay it be snakes 
“There may be snakes in the grass: 


Observing that adjunctive sd is not possible when the expletive is unpronounced, 
I argue that adjunctive sd and Topic dropped expletive compete for the same posi- 
tion, namely the low Spec-CP, hence the empty subject version of (39) is an in- 
stance of spurious Topic drop. Similarly, it can be argued that constructions like 
(41), recently brought into the debate by Engdahl (2010), are also cases of spurious 
Topic drop: 


(41) Det var bra att du sa 
that was good that you said 
‘It was good that you said so? 


I have further argued that we have spurious Topic drop in the following cases: 


(42) a. Att du sa sa var bra. 
that you said so was good 
“That you said so was good: 


b. Manga kom inte pa motet. (reading: not > many) 
many came not on meeting.DEF 
“Many did not appear at the meeting’ 


c. STUDENTER finns MANGA i Lund. 
students are many  inLund 
“There are many students in Lund? 

d. JOHAN 6ppnade dorren. 


Johan opened door.DEF 
Johan opened the door: 


@ 


..pojken som 6ppnade dérren 
boy.pEF that.REL opened door.DEF 
‘.. the boy that opened the door’ 
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If spurious Topic drop exists,!¢ then this type of invisible subjects is expected to be 
found in many places in the language, not only in cases like (40), (41), (42a), (42c) 
and (42e), where it is obvious that a visible subject is lacking. The observation that 
also ordinary subject initial main clauses are structurally ambiguous, where one 
possibility is the presence of hidden spurious Topic drop, raises the question if there 
are pragmatic differences corresponding to the different analyses, or if the ambigu- 
ity is just a theoretical quirk. Future research will hopefully provide an answer. 
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Norwegian language reforms in the 21st century 
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This paper considers the most recent developments in Norwegian language 
policy and planning, focusing in particular on the latest Nynorsk language 
reform proposal (2011). A key objective of this language reform was to include 
“the voice from below” (e.g. Linn 2010) in all stages of the process in order to 
ensure democratic legitimacy and user acceptance of the proposed reforms. It 
is thus important to ask whether, and to what extent, this intention has actually 
been realized. The argument made here is that the answer significantly depends 
on the extent to which the openness of the process is traceable in the content of 
the proposal. 


1. Introduction 


Since Einar Haugens seminal work on language conflict and language planning in 
Norway (1966), Norwegian language policy and planning has been the subject of 
a number of books and articles, most recently Linn (2010). During the 20th 
Century both written standards of Norwegian went through a number of reforms 
- some of them more successful than others. Until the second half of the century 
the overarching aim was to amalgamate the two standards. This policy, however, 
was never accepted by the Norwegian public and has long since been abandoned. 

In this paper I discuss the most recent developments of Norwegian language 
policy and planning, focusing specifically on the two most recently appointed 
Nynorsk language reform committees and the destiny of their reform proposals 
(2002/2003 and 2011). The 2002 reform proposal met with significant resistance in 
the pre-submission round of consultation, with the result that key revisions were 
made. The proposal was finally submitted to the Ministry of Culture in 2003. How- 
ever, the late changes (mentioned below) directly challenged the explicitly stated 
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wishes of the Ministry, and hence the proposal was eventually rejected by the 
Ministry in 2005. When a new committee was appointed in 2009, important 
changes were made with respect to the composition of the committee, the proce- 
dure, and to the mandate. The intention was to increase the legitimacy of the pro- 
posal by including what Linn (2010) calls “the voice from below” in all stages of 
the reform process. As the committee has concluded its work, it is essential to as- 
sess the extent to which this intention has been realized. A further issue is wheth- 
er integration of the “voice from below” in the reform process has significantly 
affected the content of the reform proposal. As deputy leader of this committee, I 
cannot, of course, claim to provide a disinterested evaluation. However, as a par- 
ticipant in the process, I have come to the view that the inclusion of lay people in 
the committee and the manner in which the reform process was conducted, in- 
deed has strengthened the legitimacy of the proposal and, hence, reduced resis- 
tance against it. Clearly, the compromises and the give-and-take resolutions that 
made a consensus proposal possible may be unsatisfactory from committed, lan- 
guage-ideological points of view. Indeed, this is borne out by recent critical reac- 
tions to the proposal by various engaged individuals and smaller groups. However, 
the large majority of major user groups have come out in favour of the proposal as 
a whole. A decisive factor in securing such broad acceptance is that the open, dia- 
logical nature of the process and the integration of the “voice from below” in the 
work of the committee in fact had a real impact on the very content of the pro- 
posal. At least, that will be the burden of my argument in what follows. 


2. Historical background 


For readers unfamiliar with the Norwegian language situation a brief historical 
account might be appropriate. (For more comprehensive discussions and accounts 
see Bucken-Knapp 2003, Faarlund 2003, Haugen 1966, Jahr 2003, 2007, Vikor 
1975). Since 1885 Norway has had two written standards, both referred to as 
‘Norwegian, which all Norwegians are required to learn: Nynorsk and Bokmal 
(previously Landsmdl and Riksmal/Dano-Norwegian). The minority standard, 
Nynorsk, was established by the poet and linguist Ivar Aasen in the middle of the 
19th Century, and had a wide variety of rural Norwegian dialects as its founding 
base. Bokmal, on the other hand, originates from written Danish, but was gradu- 
ally Norwegianized during the 19th and particularly 20th Century through a se- 
ries of language reforms. The model of the early Norweganizing attempts was the 
Dano-Norwegian variety spoken by the urban upper middle classes — the so-called 
‘educated daily speech. Later a wider range of spoken, particularly eastern 
Norwegian, varieties were included (cf. Jahr 2003). The very goal of the Nynorsk 
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project, as put forward by Ivar Aasen in his early writings concerning the ideological 
foundations of the new Norwegian language, was to establish a written norm based 
on a wide range of Norwegian dialects: 


Det er ikke min Hensigt hermed at fremheve nogen enkelt af vore Dialekter; nei 
ingen saadan bor veere Hovedsprog, men dette skulde veere en Sammenligning af, et 
Grundlag for dem alle. (Aasen 1836). 

It is not my intention to highlight any one of the different dialects; no single dia- 
lect should be the main language, but it should be a comparison of and a basis for 
all of them. (Aasen 1836, my translation). 


It is, however, no secret that Aasen was not particularly fond of the urban 
Norwegian dialect, as these were perceived as contaminated by Danish 
(cf. Meehlum & Royneland 2011). Moreover, Aasen did not wish to establish a 
wide norm permitting significant variation. On the contrary, his aim was a tight, 
invariant norm: “Sprogformen bor kun veere een” “the language form should be 
one only” (Aasen 1958: 298). Hence, Aasen’s goal was that “the standard language 
should represent the underlying system common to the dialects rather than indi- 
vidual forms from each dialect” (Faarlund 2003: 315). The idea that using a diver- 
sity of dialects as the basis of codification should entail actual representation of 
different dialect forms was put forward by subsequent language policy makers 
and planners. As we will see below, the wide range of variable forms characteristic 
of the Norwegian written standards were introduced gradually by the different 
language reforms of the 20th Century. Today, we may find quite opposing views 
and opposing ideological stances within the Nynorsk community with respect to 
the principles informing an ideal norm. The different views run along two main 
axes; the traditionalist view versus the convergence (towards Bokmal) view on the 
one hand, and the freedom of choice (or pro-dialect) view versus the restrictive 
norm view on the other (cf. Vikor 2010). Needless to say, there are differences of 
degree along both dimensions - ranging from fairly categorical to more moderate 
and pragmatic views. 


3. Background for the 2011-proposal 


On April 1st 2011, after a year and four months of intensive work, the Nynorsk 
reform committee submitted its proposal, Ny Rettskriving for 2000-talet (henceforth 
the 2011 proposal), to the Norwegian Language Council. Only a short while after, 
on May 20th 2011, the executive committee of the Norwegian Language Council 
unanimously accepted the proposal and handed it over to the implementing au- 
thority, the Ministry of Culture, for examination and final approval. September 
20th the same year the new norm was accepted and will be implemented from 
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August 1st 2012 (henceforth the 2012 norm).' This is the first major revision of 
Nynorsk since 1959. However, the Norwegian Language Council initiated the re- 
form process already in 1996. The last couple of years before the turn of the 
millennium two separate expert committees of the Language Council worked on 
comprehensive orthographic reforms for both Bokmal and Nynorsk. Jan Terje 
Faarlund was the leader of the Nynorsk expert committee (1996-2000) and a driv- 
ing force in the reform process. In 2000, after approximately three years of work, 
the Nynorsk committee was presented with an altered mandate and the committee 
restruck (for details see Fretland 2007). In 2005 the Ministry of Culture approved a 
new norm for Bokmal, whereas the Nynorsk reform proposal (2003) was rejected. 

The overarching language policy of most of the 20th Century was to merge 
the two written standards into a unified, amalgamated, pan-Norwegian standard. 
The different language reforms (1917, 1938, 1959) aimed at mutual convergence 
by introduction of Bokmal forms in Nynorsk and vice versa. The result of this 
policy was that both of the official standards obtained a great number of parallel 
forms (both in spelling and declination), and that a two level system of orthogra- 
phy within each of the written standards was established; a restricted and a wider 
norm. The restricted norm contained only so-called main forms and was obliga- 
tory in public administration and school textbooks. A main form need not be 
unique, however; two or more parallel main forms could exist. The wider norm, 
which contained so-called subsidiary forms in addition to the main forms, could 
be used by pupils, in newspapers, literature and in private writing. This policy of 
gradual, mutual approach by way of two-level norms prevailed and was the main 
goal for all language reforms until 1959. From that point on, the policy was grad- 
ually abandoned, but it was only officially rejected in 2002. During the 1980s and 
90s a number of dialect forms were also allowed in the Nynorsk standard 
(cf. Omdal & Vikor 1996: 114). The justification for this language planning prac- 
tice was that the Norwegian dialect diversity should be expressed by actual repre- 
sentation of different dialect forms in order to give people the opportunity to use 
a written language that was as close to their dialect as possible. As pointed out by 
Faarlund (2003: 313) optionality was now no longer a means towards an end, but 
an end in itself. 

The amalgamating efforts combined with the inclusion of dialect forms re- 
sulted in a bewildering amount of parallel forms. In some cases a word could be 
written in as many as eight different ways. The illustration from the website Bt.no 
presents an illuminating example. Here we can see Ivar Aasen (the founding 


1. The letter of approval from the Ministry of Culture: 
http://www.sprakrad.no/upload/spraknytt/Godkjenning %20av%20nytt%20rettskrivingssys- 
tem%20i%20nynorsk.pdf 
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father of Nynorsk) depicted in different disguises pronouncing the word “possi- 
ble” in its six possible variants (mogleg, [moglig], mogeleg, [mogelig], [muleg], 
[mulig]). The 2012-norm only allows the two main parallel forms: mogleg and 
mogeleg (the subsidiary, square bracketed, forms were left out since the affix -lig 
and the forms with -u- have been excluded from the norm, cf. Sprakradet 201 1a: 
55, 155). 


Illustration : Marvin, Bt. no, March 8th 2011 


Although the 2005 reform for Bokmal still allows a relatively wide range of free- 
dom of choice between parallel forms, the degree of variation has been 
substantially reduced, and the two level orthographic system has been aban- 
doned. Preservation or cancellation of the two level system was also a main ques- 
tion in the Nynorsk reform process and indeed the issue on which the fate of the 
2002 and 2003 proposals turned. The revised mandate of the Ministry in 2000 
clearly conveyed a wish to have the two level system repealed, and this wish was 
in fact heeded by the 2002 proposal. However, due to massive resistance in the 
public hearing, the proposal was revised on just this issue, and the Nynorsk re- 
form proposal, which was presented to the Ministry of Culture in 2003, opted to 
keep the system.” When a new Nynorsk reform committee was appointed in 
2009, however, a pivotal stipulation in the mandate was that the two level system 
should be abandoned. 


2. In fact, the story is even more complicated; the 2003 proposal, rejected in 2005, was a some- 
what modified version of a draft proposal circulated by the committee already in 2000, under 
Faarlund’s leadership, before the revised mandate from the Ministry was imposed. 
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4. The 2009 mandate 


In contrast with the 1996-2002 effort, (where after three years of work a new man- 
date was imposed and the committee restruck), the 2009 committee was presented 
with a detailed mandate, which gave thorough instructions to the committee re- 
garding both the reform process itself, and the content of the new norm. The in- 
structions were, however, as will be demonstrated, not always entirely clear or free 
from internal tensions. The very first condition that must be highlighted, however, 
is that the 2009 committee was to take the thorough and comprehensive work of 
the preceding Nynorsk reform committees (the 2003 proposal) as a basis for its 
work. This was of course a great advantage for the 2009 committee. This scholarly 
sound document represented an invaluable resource without which the new com- 
mittee could not possibly have completed its work within the relatively short time 
frame that had been set for the process. 

With regard to the content of the work, the most significant point in the man- 
date was, as already mentioned, that the two level system of orthography was to be 
abandoned: “ei tydeleg, enkel og stram norm for nynorsk, utan sideformer” “a clear, 
simple and restricted Nynorsk norm, without subsidiary forms” (Sprakradet 
201 1a: 27). In addition, the new norm should make it easier to be a user of Nynorsk 
regardless of education and linguistic competence, and it should be clear for 
everyone who uses Nynorsk as their secondary language.? However, the require- 
ment that the norm should make it “easier to be a user of Nynorsk” is not alto- 
gether straightforward. For instance, it would certainly be easier to use Nynorsk if 
attitudes towards this standard were more favourable. Obviously, however, to deal 
with such matters would go beyond the reform committee’s assignment. The un- 
derlying assumption of the requirement seems to be that it is difficult to use the 
Nynorsk norm due to the high number of parallel forms. Interestingly, language 
surveys show that approximately a third of those who have received their 
instruction in Nynorsk later switch to Bokmal (Grepstad 2010). The primary rea- 
son for this is no doubt the inferior position Nynorsk holds vis-a-vis the majority 
standard Bokmal. However, people themselves often explain the shift as a conse- 
quence of Nynorsk allegedly being too difficult to write correctly. People seem to 
feel insecure with regard to what the norm requires, and are afraid to make 
mistakes (Rodningen 1999). This insecurity may of course be linked to the 


3. Malet er 4 etablera ei norm som ger det lettare 4 vera nynorskbrukar, som er tydeleg for alle 
som bruker nynorsk som sidemal, og som er stabil over tid. Norma skal vera lett 4 bruka uavhen- 
gig av formell utdanning og sprakkompetanse. [The goal is to establish a norm which will make 
it easier to be a Nynorsk user, that is clear to everyone who uses Nynorsk as their secondary 
language and that is stable over time. The norm shall be easy to use regardless of formal educa- 
tion and linguistic competence.] (Sprakradet 201 1a: 27, my translation). 
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bewildering amount of variation in the Nynorsk norm. In fact, however, the same 
amount of variation is allowed within the Bokmal standard. So this permissive- 
ness alone cannot be the whole explanation. The significant difference is not the 
nature of the norms, but rather prevalence of use. The crucial point here is that in 
the case of Nynorsk, people make use of the allowed optionality to a relatively 
greater extent than in Bokmal. Hence, people may meet the same word written or 
inflected in different ways at every instance, since their encounters with Nynorsk 
are likely to happen much less frequently than their encounters with the majority 
standard Bokmal. This being the situation for the Nynorsk standard, it is likely 
that Nynorsk could benefit relatively more than Bokmal from a reduction of vari- 
ability allowed in the standard. As discussed by Faarlund (2003: 314), a reduction 
of orthographic choice would facilitate people's learning and present a firmer basis 
for leaning to write correctly. At the same time, however, more variation might in 
fact in some cases be perceived as easier than /ess variation, as for instance in the 
case of the reduction of the number of irregular forms by allowing parallel suf- 
fixes (see examples below). This means that the requirements clear, easy and simple 
do not always point in the same direction and might arguably be in actual conflict 
in specific cases. 

A second important requirement was that the new norm should be a national 
and not a regional norm; that is, dialect variants with a wide geographical distribu- 
tion should be preferred to more regionally restricted ones. Here, too, it is impor- 
tant to note the fact that Nynorsk is, and has always been, the less used of the two 
Norwegian standards. The proportions have varied somewhat, but today approxi- 
mately 13% of school children get their primary education in Nynorsk (cf. Grepstad 
2010). Most of these children reside in the western part of the country, as do most 
of the Nynorsk users in general. Hence, it has often been claimed that the Nynorsk 
standard is a regional, western variety of Norwegian with mainly western dialect 
forms (this claim is, however, contested and shown to by unfounded by Faarlund 
2003). A major concern for the Norwegian Language Council was, therefore, to 
prevent any further regionalization (actual or percieved) of the Nynorsk norm. The 
goal of the Nynorsk project in Norway has been and is still that it be a common 
standard language for the whole nation and not only for the Western part of the 
country. The requirement to ensure that it be a national norm may, however, come 
into conflict with the goal of reducing the number of variant forms, i-e., to create a 
more restricted norm with less variation, as reduction of variation may well out rule 
some relatively widespread (eastern) dialect variants (see discussion below). 

A third important point in the mandate was that forms dominating in the 
Nynorsk written tradition should be preferred over more marginal forms. Fre- 
quency in writing could also, however, as will be demonstrated below, well be in 
conflict with the demand to include dialect variants with a wide distribution. 
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Certain very widespread dialect forms may, as we will see, be rather marginal in 
the written tradition. 

Naturally, in accepting the appointment to the committee, the participants 
also accepted these requirements as overarching constraints on the task - the most 
important of these being the commitment to dismantle the two level system of 
orthography. However, as I have just argued, there were several unclear and poten- 
tially contradicting points in the mandate, which needed to be sorted out. In view 
of the potential conflicts between the different considerations that the committee 
was bound by its mandate to make, the committee had to make a list of priorities 
in order to carry out its assignment. 

The first point in the prioritized list was that the norm should have an inde- 
pendent foundation, that is, independent of Bokmal (cf. Haugen’s principle 
selection of norm). Hence, it should neither be an advantage nor a disadvantage 
for a given form that it be part of the Bokmal standard. This was in fact the first 
major Nynorsk language reform since 1917 that had this kind of autonomy as an 
explicit guideline and ideal. It is, however, a guideline that was not always easy or 
even possible to meet, given the current Norwegian linguistic climate and given 
Nynorsk’s position as a minority language. Bokmal is in all respects the dominat- 
ing written norm in Norway. Most textbooks, for example, are written in Bokmal 
and subsequently translated into Nynorsk. Allowing parallel forms that are dom- 
inant in the Bokmal norm might well contribute to the eventual extinction of the 
traditional Nynorsk form, since the most common practice among translators is 
to translate only what they are obliged to translate. This problem actually applied 
to a whole range of the parallel forms and represented a real headache for the 
committee. 

The second point on the priority list was that the reform process should focus 
on the Nynorsk written tradition and practice as it has evolved over the centuries 
and place particular emphasis on contemporary use of Nynorsk. The committee 
had at its disposal a balanced corpus of 87 million words from written Nynorsk 
texts that covers a wide range of different genres and includes texts from the end of 
the 19th Century up until today (Det nynorske tekstkorpuset ved Universitetet i Oslo 
[The Nynorsk text corpus at the University of Oslo]). The main body of the corpus, 
however, (approximately 80%) consists of contemporary texts from the 1980s and 
forward. This corpus, which is searchable trough an Internet based interface, is a 
quite recent tool that was not available to the previous reform committees. Hence, 
the 2009 reform committee could establish empirically the actual frequency of use 
of different forms and consequently had a firmer basis on which to make their 
decisions. The committee’s goal was to approximate what has been labeled a 
“centerline” or “prototypical” Nynorsk, that is, the combination of variants that the 
majority of Nynorsk users actually apply in their writing (cf. Vikor 2009, 2010). 
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Thirdly, geographically widespread dialect features should be attributed weight, 
while, fourthly, lesser-used parallel main or subsidiary forms should, as a main rule, 
be excluded from the norm, whereas main or subsidiary forms with a relatively 
high standing (either in writing or in the dialects) should be retained. All these re- 
quirements were of course part of the mandate, but the point here was to make a 
prioritized list that could be helpful in cases where different requirements conflict. 

An example of how several requirements came into conflict is the case of the 
very frequent word form berre [bare] ‘only. In the Nynorsk norm previous to the 
2012 norm this word form had one main form berre and one parallel subsidiary 
form [bare]. The main form berre clearly dominates in the Nynorsk written tradi- 
tion and in contemporary Nynorsk, but the subsidiary form [bare] is very wide- 
spread in the Norwegian dialects. It is used in all of the main Norwegian dialect 
areas and it clearly dominates in the northern part of the country. For many 
Nynorsk users that have this form in their dialect, [bare] seems to have acquired 
an important symbolic status and may function as a signal of a certain kind of 
Nynorsk identity. These are of course good arguments for keeping [bare] as a par- 
allel form even if berre dominates in writing. However, [bare] is the sole main form 
in Bokmal and even if the committee was supposed to pay attention only to 
Nynorsk and not be concerned with whether a form was part of the Bokmal norm, 
this fact nevertheless did have an impact on the committee's final decision. The 
committee could not ignore the possibility that [bare] in the long run might dis- 
place berre, given the dominant position of Bokmal (cf. the elaboration of this 
point above). After long discussions the committee finally decided only to keep 
the main form berre, invoking the priority of written tradition over oral use. 

As discussed above the committee also had to prioritize between “clear and 
simple” on the one hand and “restricted” on the other. Simple and restricted do not 
always coincide, and in cases of conflict simple was chosen before restricted. This 
was, however, a point of much discussion in the committee. As already mentioned 
it has been claimed that a reduction of orthographic choice would facilitate peo- 
ple’s learning and present a firmer basis for leaning to write correctly (cf. Faarlund 
2003: 314). Though this may be generally true, there are cases where the opposite 
might hold. One example of this may be found in noun inflexional system. Most 
feminine nouns have the plural indefinite and definite suffixes -er/-ene like in (1). 
However, certain single syllable feminine nouns have -ar/-ane as main forms and 
[-er-ene] as subordinate forms like in (2) and (3). 


(1) bygd - bygd-er - bygd-ene 
village — village-PL.INDEF. - village-PL.DEF. 


(2) elv - elv-ar - elv-ane 
river — river-PL.INDEF. — river-PL.DEF. 
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(3) elv - [elv-er] - [elv-ene] 
river — river-PL.INDEF. — river-PL.DEF. 


The question was whether the norm would become simpler and easier to learn if 
the subordinate forms were omitted, or, alternatively, if they were paralleled with 
the main suffixes. In this case the committee thought that giving Nynorsk users the 
opportunity also to apply the main paradigm in these single syllable nouns, that is 
-er/-ene, would be the easiest since it represents a simplification of the system.* 
There is also a clear tendency that the -er/-ene suffixes are gaining ground in the 
dialects. Hence, both suffixes were included in the new norm, and “simple’, inter- 
preted as possible reduction of irregularity, was chosen before “restricted”, under- 
stood as a reduction of possible choice. 

In most cases, however, reduction of norm-internal variability was chosen. 
Weak feminine nouns, for instance, had the plural suffixes [-or/-one] as subordi- 
nate alternatives to the main forms -er/-ene and the singular indefinite suffix [-a] 
as subordinate alternative to -e (cf. examples 4 and 5): 


(4) vis-e — vis-er — vis-ene 

song-SG.INDEF. — SOng-PL.INDEF. — SOng-PL.DEF. 
(5) [vis-a — vis-or — vis-one] 

song-SG.INDEF. — SOng-PL.INDEF. — SONg-PL.DEF. 


In this case the committee considered an omission of the subordinate form to be 
the “easiest” solution. This decision was supported both by the frequency distribu- 
tion of the main and subordinate variants and by its distribution in the dialects. 
According to the Nynorsk corpus the subordinate form is very rarely used in writ- 
ing and the plural suffixes [-or/-one] are losing ground in the areas that tradition- 
ally used to have these forms (cf. Skjekkeland 2005). 

Even if the mandate and the list of priorities may seem straightforward 
enough, it proved in many cases to be quite difficult to implement. The reform 
committee's work may well be described as a balancing act between the different 
considerations embedded in the mandate, the different and often diverging views 
of the members of the committee, and not least diverging responses from the 
public. 


4. ‘There are, however, some feminine nouns that have -ar/-ane as the only possible plural in- 
definite/definite suffix. This applies to all feminine nouns with the suffixes -ing/-ning: 

dronning - droning-ar - droning-ane 

queen - queen-PL.INDEF - queen-PL.DEF 


These form have a strong position in the dialects and are also a productive group of nouns. 
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5. The committee and the process — “the voice from below” 


Including the so-called “voice from below” in all stages of the reform process was 
a major concern for the policy makers when the new Nynorsk reform process was 
initiated in 2009. In practical language planning and policy-making this means 
allowing “the views and priorities of language users, rather than top-down politi- 
cal will, to dictate the direction of language policy making” (Linn 2010: 115). As 
pointed out by Linn (2010), most of the critical language planning literature tends 
to maintain the so called “bird’s eye view” and does not apply the “worm’s eye 
view’. This means that both theory development within the field as well as the ex- 
amination of ideological debates are principally conducted from the point of view 
of the primary actors and policy makers and not from that of primary users. This 
has been the tendency not only in critical language planning, but also in practical 
language planning at least as it has been conducted through Norwegian language 
reforms and policy-making in the 20th Century. Only with the latest Nynorsk lan- 
guage reform did the rhetoric change dramatically as major emphasis was put on 
including the “worm’s eye view” (but see Jahr 2007).° It was an explicit goal that 
both the composition of the committee and the completion of the reform process 
should ensure broad societal foundation. The Norwegian Language Council stated 
the following about its strategic considerations concerning the appointment of the 
committee: 


Nemnda er sett saman med sikte pa at ho skal ha hog strategisk normering- 
skompetanse pd nasjonalt nivd, sentral sprakvitskapleg kompetanse om moderne 
sprakbruk, historisk og praktisk roynsle fra malrorsla og deltaking fra nokre av dei 
viktigaste brukargruppene. (Sprakradet 201 1a: 28). 

The committee is composed with the objective that it have high strategic norm 
reform competence at a national level, key linguistic competence about modern 
language use, historic and practical experience from the Nynorsk language move- 
ment, and participation from some of the most important groups of Nynorsk 
users. (Sprakradet 2011a: 28, my translation). 


The 2009 committee had two journalists, two teachers, two linguists and one pub- 
lic servant amongst its members.° Several of these members had thorough knowl- 
edge about the Nynorsk grass roots, practical experience from different Nynorsk 


5. Although some scholars have called the planning of modern Norwegian “a sociolinguistic 
experiment - ‘from below” (see Jahr 2007). 


6. ‘These people were appointed: Senior advisor Grete Riise (leader), Associate Professor Unn 
Royneland (deputy leader), journalist Ragnhild Bjorge, teacher and writer Tore Elias Hoel, 
teacher Karin Magnetun, journalist and editor Svend Arne Vee and director of Norsk Ordbok 
2014 Ase Wetas. 
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language interest organizations (e.g. Noregs Mallag, Norsk Madlungdom, 
Mediemallaget), inside knowledge of the difficulties Norwegian elementary school 
children have in learning Nynorsk, and good insight into the resistance Nynorsk 
encounters in Norwegian society at large. The newly appointed language com- 
mittee was, however, met with substantial criticism from different angles - among 
other things for its lack of scientific linguistic competence as only two of the com- 
mittee members were linguists (cf. Faarlund 2010, Lunden 2010). Replying to 
some of these reactions, a member of the previous language committee, Oddrun 
Gronvik, stated that “weak connection to society at large” was an important ex- 
planation for the 2003 proposal’s lack of success: 


A gjera endringar i ei etablert skriftnorm er inga lett sak, og handlar ikkje berre om 
ortografiske prinsipp og reglar. Eg sat i den forre rettskrivingsnemnda (2001-2003), 
som var ei nemnd med sterk fagkompetanse, men rekruttert fra eit smalt miljo av 
vitskapsfolk og leksikografar. Det enda med at nemnda kloyvde seg og bae framlegg 
vart avviste. Det var mange grunnar til det, men ein av dei var heilt opplagd veik 
samfunnskontakt. Korkje prosess eller resultat vart gode nok. (Gronvik 2010). 

To make changes in an established written standard is no simple matter, and is 
not only about orthographic principles and rules. I was part of the previous or- 
thographic revision committee (2001-2003), which was a committee with strong 
scientific competence, but recruited from a narrow environment of scholars and 
lexicographers. The final result was the committee was divided and both propos- 
als were rejected. There were many reasons for this, but one, without a doubt, was 
the committee’s weak connection to society at large. Neither the process nor the 
result was good enough. (Gronvik 2011, my translation). 


As Gronvik points out, to make changes in a norm is not only a matter of applying 
orthographic rules and principles. Proposed norm revisions have to be implement- 
ed by some kind of social institution and they need to be accepted by the commu- 
nity (cf. Haugen’s 1966 principles of language planning); hence, a reform proposal 
needs to be perceived as legitimate by the community. In order to ensure such le- 
gitimacy it was, according to the Norwegian language policy makers, important 
that the new reform committee should include stakeholders from many different 
parts of society and that the reform process should be open and inclusive. 
Legitimacy is no doubt a vital requirement for a reform committee. However, 
one might disagree on how such legitimacy is achieved. Scientific competence is 
obviously one essential factor (as pointed out by Faarlund 2010), but at this par- 
ticular historical point it was probably not enough. Other types of competence 
were seen as equally important. The previous Nynorsk revision committee, which 
only had linguists amongst its members, did not manage to reach consensus and 
their proposal was not accepted, neither by the public at large, nor by the political 
decision makers. Naturally, its particular composition may not be the reason why 
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the committee failed to reach an agreement - although it lies at the heart of aca- 
demic endeavor to search out differences and points of divergence rather than to 
seek consensus - nor why its split submission was not accepted. More important, 
probably, is the point brought to light by Gronvik, that a reform needs broad soci- 
etal foundation and support in order to succeed. A quite recent study from 
Zimbabwe suggests that practical language planning tends to be more successful 
when approached bottom-up, whereas top-down approaches generally fail 
(Makoni, Makoni & Nyika 2008). Arguably, the two most recent Nynorsk lan- 
guage reforms can be seen to point in the same direction. 

There is no doubt, however, that the new Nynorsk reform committee never 
would have succeeded in arriving at a reform proposal within the established time 
frame were it not for the comprehensive and detailed work laid down by the previ- 
ous language expert committees. Indeed, the 2003 proposal was an essential pre- 
requisite and working tool for the new committee.’ In effect the new committee 
was able to draw on a broader scientific competence than that which was included 
in the committee itself. The comprehensive work already done by outstanding 
scholars (Faarlund prominent among them), allowed policy makers in this in- 
stance to put greater emphasis on other aspects of legitimacy and on including 
“the voice from below’, both with regard to the composition of the committee and 
with regard to the reform process as such. At this point, this was probably a wise 
decision, given that general acceptance from the public is fundamental for the suc- 
cess of a new norm. 

The Chair of the language Council, Ottar Grepstad, called the 2011 proposal 
an “historic compromise” and claimed that no former Norwegian language reform 
has had broader political legitimacy or a better foundation (cf. Sprakradet 2011b 
& c). Different from all other Norwegian language reform proposals during the 
20th and 21st Century, and in sharp contrast to the way in which most language 
revision processes are set out, the Norwegian Language Council agreed on a man- 
date which was meant to ensure an open and inclusive process all the way through. 
The reform committee, which started its work in January 2010, carried out this 
requirement by arranging a number of meetings with institutions, companies and 
organization, by establishing a web-site and a discussion forum, by continuously 
publishing on the web all case documents and reports from the meetings, and by 
presenting a discussion document and arrange a public hearing.® In total the 


7. The committee was appointed in December 2009 and started its work January 2010. The final 
deadline for the proposal was April 1st 2011. Hence the committee had one year and three months 
to conclude its work. Previous reform committees have had much more flexible time frames. 

8. The discussion web site can be accessed at the following URL: http://www.sprakrad.no/nb- 
NO/Toppmeny/Aktuelt/nynorsknorm/. 5 letters from individual persons were sent to the 
Language Council after the reform committee had concluded its work. 
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reform committee arranged 38 meetings with different institutions all of which 
provided important input to the committee's work. These were meetings with rep- 
resentatives from publishers, translating firms, Nynorsk interest organizations, 
newspapers, teacher organizations, and universities. In addition to these meetings 
the committee also received many comments and suggestions electronically. A to- 
tal of 502 contributions were posted either on the discussion web-forum or sent to 
the committees e-mail accounts in the course of the norm revision process. These 
contributions were introduced into the committee’s discussions and contributed 
importantly to their substance. Since all case documents and subsequent reports 
were published immediately before and after the meetings, people were able to fol- 
low the committee’s work very closely and could send reactions and feedback, as 
the process was taking place. Hence the interested public had open access to the 
decision-making process. In previous language reform processes the public only 
had the opportunity to react to a more or less completed norm revision proposal. 
By contrast, the preliminary proposal of the 2009 committee, published in the 
beginning of February 2011, had already been subject to a great many responses 
and suggestions before it was submitted to the public hearing. The hearing, which 
was held February 21st 2011, had approximately 380 participants (either physically 
present or taking part by web-television) and the committee received 78 consulta- 
tive submissions (which together amount to a document of 232 pages). In this last 
part of the process, when people had the opportunity to see the discussion docu- 
ment as a whole, the committee received many and often diverging views and re- 
actions to its work. 

A significant example of a proposed change that received particularly many 
(mostly negative) responses both before and during the consultative round is the 
case of the infinitive system. The two-level Nynorsk standard allowed three differ- 
ent infinitival endings: two parallel main forms -a or -e (kasta, vera or kaste, vere 
‘throw, ‘be’) and a subsidiary system where the infinitival ending would shift be- 
tween [-a] and [-e] in accordance with the principle of the so-called syllable bal- 
ance rule (kaste vs. vera). According to this rule, words that had short root syllable 
in Old Norwegian retained the vowel in the final syllable, whereas words with a 
long root syllable got a reduced word final vowel (schwa). The reflexes of the syl- 
lable balance rule may be seen in infinitives and in weak masculine and feminine 
nouns in the eastern Norwegian dialects, but in the Nynorsk standard the rule is 
only reflected in infinitive endings. The two main parallel Nynorsk forms are vari- 
ants of western Norwegian dialects where the principle of vowel balance did not 
apply. Western and northern dialects have -e (schwa), -a, or no ending at all in all 
infinitives. Traditionally the different historical effects of the syllable balance rule 
have constituted the most decisive variable employed by Norwegian dialectolo- 
gists when dividing Norway into different dialect areas. This variable divides 
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eastern Norway from western and northern Norway, and it also divides eastern 
Norway internally according to how the syllable balance rule is implemented 
(for details and maps see Skjekkeland 2005, Mehlum & Royneland 2012). This 
system is, however, currently in decline in eastern Norway where the use of infini- 
tives with schwa is expanding (e.g. Royneland 2009, Skjekkeland 2005). The in- 
finitive ending with -e is by far the most frequent in modern Nynorsk written texts 
(and is the only permitted variant in Bokmal), whereas the subsidiary divided 
form [-a]/[-e] has almost fallen out of use (cf. Sprakradet 2011a: 172). The 2009 
revision committee therefore proposed to reduce the number of possible infinitive 
endings by keeping only the two main parallel forms. This proposal, however, pro- 
voked quite a strong reaction — even a facebook group in support of the divided 
infinitive endings appeared. The committee had apparently touched upon a nerve. 
Most of the arguments in favor of maintaining the divide had to do with issues of 
identity; removal of the divided form would deprive eastern Norwegian Nynorsk 
users of a key device for expressing their eastern identity in writing (see discussion 
in Section 4). Since the reactions were so clearly in favor of keeping the divided 
system, the committee reversed its initial stance on this matter. 

The goal of this open process was to bring into play the “voice from below” in 
a manner that would help the committee to balance different views and ensure 
that the process would be as democratic as possible. And we have seen that the 
feedback channels built into the process in fact came to have substantive influence 
on choices made by the committee. Still, one may ask whether the mechanisms I 
have described could ensure a sufficiently high degree of unconstrained participa- 
tion, and thus whether the process was genuinely democratic. 


6. Responses to the proposal and the process 


It should come as no surprise that the question of democratic participation has 
received very different answers from different quarters. Thus, in a letter sent from 
the Norwegian Language Council after the Executive Board had unanimously ac- 
cepted the reform proposal, the process was described as follows: 


Arbeidet med 2012-normalen kan gjerne bli eit Icrestykke i moderne 
samfunnsendring om korleis prosessar av dette slaget tener til a finne fram til kloke 
avgjerder i vanskelege sporsmdl der meiningane er sveert delte. (Sprakradet 2011c). 
The work with the 2012 norm may well stand as a model for current practices of 
social change, illustrating how processes of this kind may lead to wise decisions on 
difficult issues where opinions diverge widely. (Sprakradet 2011c, my translation). 
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However, not all stakeholders agree with the Language Council’s assessment. In a 
letter sent to the National Language Council, after the conclusion of the Nynorsk 
reform process, Associate Professor of Norwegian and supporter of the inclusion 
of Eastern Norwegian forms in Nynorsk, Tordis Irene Fosse, criticizes the reform 
process and questions its democratic status in light of the final result: 


Den demokratiske prosessen i rettskrivingssaka blir framheva, ikkje minst av leiaren 
i Sprakradet. Men er det nok a fa koma med synspunkt, dersom synspunkta ikkje 
blir tatt omsyn til? Mange av horingssvara var negative til fierninga av austnorske 
former, med omtrent den rekka med eksempel som eg har med i tabellen. Nemnda 
har justert sveert lite i den endelege innstillinga. (Fosse 2011). 

The democratic process of the language reform proposal is highlighted, not least 
by the chairman of the Language Council. But is it enough to be allowed to pres- 
ent your opinions, if these opinions are not taken into account? Many of the con- 
sultative submissions were negative towards the removal of eastern Norwegian 
forms, with approximately the range of examples I have included in the table. The 
reform committee has only to a very small extent made adjustments in its final 
proposal. (Fosse 2011, my translation). 


In her letter, Fosse seems to argue that unless the opinions and suggestions put 
forward in the consultative round have direct impact on the final result, there is no 
evidence that they have been taken into account, and thus the democratic author- 
ity claimed for the process is dubious. The decisive issue here is how we may rea- 
sonably understand what it is to “take into account” submissions and responses. 
Should this be restricted to what can be shown to have had direct impact on the 
final proposal? If we answer in the affirmative, then it may seem that a great many 
of the different opinions that were presented through the consultative round were 
of no real significance, and that the process, accordingly, was democratic in ap- 
pearance only. This is a difficult and challenging criticism to meet. However, we 
may note that it is a very common objection to non-consensus conclusions and 
decisions of this type. In a consultative round there will obviously be many differ- 
ent and often opposing views. In a democratic process a crucial task is precisely to 
try to balance these different opinions. Clearly, this will not be a matter of sheer 
quantitative weighing of responses. As John Stuart Mill aptly frames the challenge 
in his Considerations of Representative Government (1861), “One of the two, as the 
wiser or better man, has a claim to superior weight: the difficulty is in ascertaining 
which of the two it is” (here after Estlund 2008: 1). The result of this balancing will 
typically be that some stakeholders experience that their voice was not heard, that 
their particular views were not given due consideration, and thus did not have the 
intended impact on the final outcome. This perception will often, and quite natu- 
rally, lead stakeholders to view the process as illegitimate. The question is, how are 
we to interpret and evaluate such assertions? Without pursuing a discussion of the 
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principles of democratic process, we can grant that the ideal in a democratic pro- 
cess must be that legitimate stakeholders will acknowledge its integrity and thus 
accept the result as legitimate, even if the final conclusion is not fully in accor- 
dance with their own preferences. Thus, accusations of the sort that Fosse makes 
are on the face of it worrying. Surely, however, it cannot be a general requirement 
on policy-making procedure that no such negative reactions are produced. Their 
merit and their significance, it seems to me, will depend on how widespread they 
are, and, in particular, on whether or not such challenges are made from a variety 
of incompatible positions on the matter to be resolved. Thus, if the challenges to 
democratic integrity arise from relatively few quarters, and from positions with 
quite different substantive positions, then the claim to genuine openness and to 
democratic authority may still be sustained (cf. Estlund 2008). 

‘The representatives of the Nynorsk reform committee were charged with the 
responsibility of acting in the Nynorsk people's interest, but not as their proxy rep- 
resentatives nor always in accord with their explicit preferences (as if this were even 
possible), but with enough authority to exercise swift and resolute decisions based 
on the given mandate. A process cannot be dismissed as non-democratic only be- 
cause some stakeholders failed to get their expressed wishes reflected in the final 
outcome. The decisive question, then, must be whether the committee and its work 
is accepted as legitimate by different participants, with different views and prefer- 
ences, in spite of the fact that their preferred variant was not chosen. In fact, the vast 
majority of the submissions that were presented in the consultative round acknowl- 
edged the mandate, the reform committee and the reform proposal as legitimate 
and the process as democratic even if they did not agree with the committee at all 
levels or in every detail. Many organizations, institutions and interest groups, stated 
that they would act in solidarity with the proposal in spite of minor disagreements 
(cf. the consultative submission). In a letter to the Ministry of Culture dated May 
26th 2011, the board of the major Nynorsk interest group, Noregs Mallag, states: 


Styret i Noregs Mallag meiner at rettskrivingsnemnda alt i alt har gjort eit godt 
arbeid, og at nemnda sitt framlegg som heilskap er eit godt framlegg i trad med nem- 
nda sitt mandat. [...] Denne normeringsprosessen har vore konstruktiv og rolegare 
enn tidlegare normeringsprosessar. Styret i Noregs Mallag meiner noko av grunnen 
er at rettskrivingsnemnda og Sprakradet har lagt godt til rette for innsyn i og innspel 
til arbeidet i nemnda. (Noregs Mallag 2011). 

The Board of Noregs Mallag consider that the reform committee all in all has done 
a good job, and that the committee's proposal as a whole is a good proposal in 
line with the committee’s mandate. [...] This language planning process has been 
constructive and calmer than previous language planning processes. The Board of 
Noregs Mallag thinks that part of the reason is that the reform committee and the 
Norwegian Language Council have prepared the ground well for insight into and 
contributions to the committee's work. (Noregs Mallag 2011, my translation). 
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Ina similar vein, in its consultative submission on the discussion document, repre- 
sentatives from the University of Bergen (the major University of western Norway 
and a significant stakeholder with regard to the development of Nynorsk) writes: 


Nemnda fortener stor ros for a ha gjennomfort ein open prosess. Og sjolv om 
vi ikkje stor alle detaljar i framlegget fra nemnda, held vi med nemnda i det 
meste. (Universitetet i Bergen, 24. februar 2011). 

The reform committee deserves credit for having carried out an open process. Even 
if we do not support the proposal in every detail, we mostly agree with the commit- 
tee on most matters. (University of Bergen, February 24th, 2011, my translation). 


The University of Bergen open their commentary by addressing the mandate and 
state that not all constituents of their academic community agreed on the prem- 
ises laid down in the mandate - particularly with regard with the abolition of the 
two-level system of orthography. However, they leave criticism concerning the 
mandate, accepting its terms as they continue with a detailed review of the dis- 
cussion document. Although also other stakeholders were critical of the mandate 
and the language policy objectives behind it, most stakeholders seemed to agree 
that a Nynorsk language reform was necessary and a reduction of forms desirable 
(only not which forms). As explicitly stated in the submission from the University 
of Bergen, the type of critique that is most relevant to a discussion document is 
critique that takes the mandate as its point of departure. This is not to say that 
critique of the mandate itself is not legitimate per se, only that it must be kept 
distinct from the assessment of the legitimacy of the process in so far as it is based 
on the mandated premises. This would simply be another discussion. Hence the 
critique that has been considered here concerns primarily the alleged democratic 
nature of the reform process and legitimacy of the committee within the man- 
dated terms. 

The discussion document was, however, also criticized for not conforming to 
its mandate in all cases. This is of course a most relevant and important criticism 
that bears on the committee’s interpretation of the mandate and with the balancing 
of the different considerations in the mandate. Some stakeholders claimed that too 
much emphasis was put on written practice at the expense of dialect usage, where- 
as other stakeholders expressed the view that too much emphasis was put on 
current dialect distribution at the cost of the written tradition. This mirrors the 
opposing ideological views and stances that are present within the Nynorsk com- 
munity (i.e. the traditionalist view versus the convergence towards Bokmal view on 
the one hand, and the freedom of choice (or pro-dialect) view versus the restrictive 
norm view on the other (cf. Vikor 2010)). In their more pronounced versions these 
opposing stances were difficult, not to say impossible, to accommodate. 
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As well, some reasonably reacted to the fact that the committee suggested in- 
troducing two new forms in the Nynorsk orthography; 2nd person plural pronoun 
dykk - dykk in addition to the existing de - dykk. The background for this sugges- 
tion was the fact that many Nynorsk users have difficulties in using these forms - 
first, they do not have the distinction between a subject and an object form in their 
dialect (this is increasingly the case as more and more dialects are losing the 
distinction) and, secondly, they find the form dykk quite alien. Actually very few 
dialect speakers have this particular form in their dialect. The most widespread 
form is some variant of dokk(e/er). The reason why this form was selected by Ivar 
Aasen, was probably that great variation in the dialects lead him to choose the form 
that corresponded best with the Old Norse form (bykkr). Given that many Nynorsk 
users feel this form to be distant and unnatural, the committee's suggestion to in- 
troduce dykk also as a subject form (to avoid problems for those who do not have 
subject/object distinction in their dialect), was an unhappy one. Almost all the con- 
sultative submissions were negative on this point — although not necessarily nega- 
tive to the idea of introducing an alternative form. Hence, in the final proposal the 
forms dokker-dokker were suggested instead. This was a proposal that first came up 
on the committee’s web discussion forum and also in e-mails to the committee dur- 
ing the reform process, and again in the consultative round. Here, then, we have 
another clear instance where the “voice from below” determined the outcome. Al- 
though this proposal probably accommodated the view of many stakeholders, oth- 
ers were critical, arguing that the committee in this instance failed to realize its 
mandate - that is, to reduce the number of alternative forms. In this particular case, 
however, the committee, influenced by feedback, reasoned that simplicity (also a 
consideration emphasized in the mandate) should be given greater weight than 
restrictiveness. Of course this is a debatable stance. The example well illustrates the 
sort of case where arguments in opposing directions are readily available within the 
terms of the mandate. It will, therefore, be very interesting to see whether the new 
forms will be adopted and used by the Nynorsk community. Ultimately, it must be 
the effect on practice that will determine their usefulness and success. 

It seems fair to say that most stakeholders agreed that a Nynorsk language 
reform was necessary, and also that a reduction of forms was desirable (though, 
of course, disagreeing on which forms should go).? The fiercest criticism was put 


9. Nynorsk kultursentrum, an association of 19 administrative regions, counties, educational 
and cultural insitutions and organizations says for instance: "At det i ei sak med sd mange og 
ulike synspunkt blir lagt fram ei samroystes instilling, vitnar om solid arbeid og vilje til nyan- 
serte vurderingar, og gir innstillinga den tyngda den fortener. I enkeltsporsmal er meiningane 
delte innanfor var institusjon som dei nok er i mange andre miljo. Di viktigare er det at eit breitt 
nynorskmilj@ no kan samle seg om det som er viktigast. (Nynorsk kultursentrum, February 
2011) [That an issue with so many and such diverging points of view is presented in a unanimous 
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forward by relatively small but ideologically strongly committed groups occupy- 
ing opposing poles of the language ideological spectra (as explained above). Of 
those who accepted the politically determined mandate and the general level of 
language policy, most major stakeholders are in favor both of the process and the 
result. 


Dei sentrale nynorskinstitusjonane som har uttalt seg, har da ogsd stotta hovudtrek- 
ka i framlegget, og ingen har gatt prinsipielt imot. Til skilnad fra den forre debatten 
om nynorskrettskrivinga 2001-2003 har eit breitt nynorskmiljo no samla seg om 
den nye rettskrivinga. (Sprakradet 2011c). 

The main Nynorsk institutions that have presented their views have supported 
the key features and none has opposed at the level of principle. In contrast to 
the previous debate about Nynorsk orthography 2001-2003 a broad coalition of 
Nynorsk users has come together in support of the new norm (Sprakradet 2011c, 
my translation). 


However, while a fairly strong case can be made for the democratic nature of the 
process and its outcome, recent critical discussion in Norwegian media highlight 
a further, related issue. The 2012 norm has recently been a subject of debate in a 
series of newspaper articles and letters to the editor in several Norwegian newspa- 
pers (in particular Aftenposten; January 22nd, February Ist & 6th 2012). The new 
norm has been accused of falling short with respect to the mandated requirements 
of simplicity of use and reduction of forms. The claim is that the new norm is 
messy and chaotic, and that it actually allows more variation than before. A prime 
example is the sentence, ”Vi onskjer dykk velkomne” “We wish you welcome,’ 
which in theory may be written in 16 different ways since it now is possible to 
choose between dykk/dokker (you), onskjer/onsker/ynskjer/ynsker (wish) and vi/me 
(we)!°. In practice, however, stylistic restrictions would prohibit many of the 


proposal, bears witness to thorough work and a willingness to make nuanced assessments, and 
gives the proposal the weight it deserves. Within our institution opinions are divided when it 
comes to individual cases, like they probably are in many other institutions and organizations. 
This makes it all the more important that a broad Nynorsk society gather around what matters 
most] (Nynorsk kultursentrum, February 2011) (my translation). 


10. ‘The four first alternatives were possible before the reform and the twelve next have been 
added after the norm revision. The first alternative is the most conservative variant whereas the 
last is the most radical. I would argue that the combinations in example 5-14 are not very plau- 
sible since they combine quite conservative with rather radical forms. People are more likely to 
choose a combination of either conservative forms or more radical forms. 

1. Me ynskjer dykk velkomne 
Me onskjer dykk velkomne 
Vi ynskjer dykk velkomne 
Vi onskjer dykk velkomne 
Me ynskjer dokker velkomne? 
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theoretically possible combinations. Nevertheless, the general claim, of which this 
sentence is intended as dramatic illustration, is correct; at some points the new 
norm will indeed allow more freedom of choice. This is, first and foremost, an in- 
evitable result of the fact that the two-level system, in accordance with the com- 
mittee’s mandate, has been repealed. The committee has accordingly reduced the 
wide norm significantly. To produce a more restrictive narrow norm, however, 
would have entailed a dramatic curtailment of existing practices, and would have 
had no chance of realizing a balance of concerns generally acceptable to most 
stakeholders. Hence the 2012 norm allows Jess variation than the former wide 
norm (which included both main and subsidiary forms), but more variation than 
the former narrow norm. Some stakeholders obviously would have wanted a re- 
duction of the narrow norm. However, this was not the mandate of the committee, 
and would in any case not have been a possible task, given the current language 
policy climate. So while criticism of particular decisions (such as the introduction 
of dokker-dokker, as discussed above) may turn out to be warranted, the inference 
that the committee has subverted its mandate seems unfounded. Although this 
kind of criticism has frequently been framed as process internal — that is to say, as 
based on the premises of the mandate and its explicit constraints - it seems rea- 
sonable to read it rather as an implicit objection to the mandate itself. Here the 
committee's work is being placed in a larger political context, and used for larger 
language political purposes. This is, needless to say, quite legitimate and to be ex- 
pected in a working democracy. It does not entail, though, that the committee 
ought to be moved by this in its work, or in its assessment of the outcome. My 
concern here is restricted to the legitimacy of the actual process and the ensuing 
proposal, and this entails a limitation on the kind of criticism that is directly rele- 
vant. My argument has been that the legitimacy of the committee and its proposal 
must be measured in this perspective. 


6. Me ynsker dykk velkomne? 
7. Me ynsker dokker velkomne? 
8. Me onskjer dokker velkomne? 
9. Me onsker dykk velkomne? 
10. Me onsker dokker velkomne? 
11. Vi ynskjer dokker velkomne? 
12. Vi ynsker dykk velkomne? 
13. Vi ynsker dokker velkomne? 
14. Vi onskjer dokker velkomne? 
15. Vi onsker dykk velkomne 
16. Vi onsker dokker velkomne 
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7. Conclusion 


Essential to the success of the 2011 proposal was a pragmatic attitude on the part 
of all actors involved in the process (policy-makers, the Language Council, the 
reform committee and the public at large) and a strong will to reach agreement in 
spite of internal disagreement. When the reform proposal was accepted by the 
Ministry of Culture in September 2011, the chairman of the Norwegian Language 
Council called it “Ein liten flagg-dag for viljen til a bli samde” “A small day of vic- 
tory for the determination to reach consensus’, and continued: 


2012-normalen er den storste rettskrivingsreforma for nynorsk pa femti dr. Inga 
tidlegare rettskriving for nynorsk har vore sa grundig diskutert av sa mange og fatt 
sa brei tilslutning. Eg sender ein stor takk til alle dei som strekte seg langt for a skape 
ei rettskriving for nynorsk som femner heile landet og sveert ulike grupper av sprak- 
brukarar, seier styreleiar Ottar Grepstad i Sprakradet. 

The 2012 norm is the most comprehensive orthographic reform for Nynorsk in 
fifty years. No previous Nynorsk reform has been so thoroughly discussed by so 
many and had such wide support. I send a big thank you to all those who went to 
great lengths in order to create a new orthography for Nynorsk that embrace the 
whole country and very different groups of language users, says chairman of the 
Language Council, Ottar Grepstad. 
http://www.sprakrad.no/nb-NO/Toppmeny/Aktuelt/-Ein-liten-flaggdag-for- 
viljen-til-a-bli-samde/ (accessed December 2011). 


As we have seen in the above discussion, most stakeholders did seem to accept the 
committee's work and acknowledge both the committee and the process as demo- 
cratic and legitimate - even some who questioned the mandate that the committee 
was given. A discussion of the premises laid down in the mandate has not been the 
focus of this paper. Rather, my question has been whether the committee fulfilled 
the mandate in an adequate way. I hope to have given some reason to think that 
the process indeed was “open and inclusive,’ in the sense that the “voice from be- 
low” was successfully included and in fact had a real impact on the very content of 
the outcome.! 


1. I would like to express my gratitude to Lars Vikor, Ase Wetas and Grete Riise for very help- 
ful comments on a draft of this paper. Thanks also to my students at University of Chicago 
Spring Quarter course 2012, Language Policy and Planning, for interesting and valuable discus- 
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discussions and important contributions throughout the writing process. 
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Gender maintenance and loss in Totenmalet, 
English, and other major Germanic varieties 


Peter Trudgill 
University of Agder 


I Bergen pa hansatida var det neer kontakt mellom to sveert like sprak, gammalnorsk 
og mellomlagtysk. Begge desse spraka hadde tre genus ... i substantiv. Resultatet av 
denne kontakten var eit sprak, eller snarare ein dialekt, med to genus (Jan Terje 
Faarlund 2004)! 


Indo-European is one of a minority of language families in the world that are char- 
acterised by the presence of the phenomenon of grammatical gender.” The earliest 
known form of Proto-Indo-European had only two grammatical genders: Fortson 
(2010: 114) tells us that “the oldest preserved branch of Indo-European, Anato- 
lian, had only a two-way distinction between animate or common gender and in- 
animate or neuter”. According to Ringe (2006), however, all the extant modern 
Indo-European varieties descend from the non-Anatolian branch of Proto- 
Indo-European, which he calls North Indo-European. This branch represents a 
more innovative stage of Proto-Indo-European in which “a three-way contrast in 
grammatical gender” had developed (Fortson 2010: 114), it being widely agreed 
that the innovation which gave rise to the new tripartite system was the develop- 
ment of a new feminine gender. Then, as Dahl says (2004: 199), 


1. ‘The passage reads in full [in my English translation]: “In Bergen during the Hansa Period 
there was close contact between two very similar languages, Old Norse and Middle Low German. 
Both these languages had three genders and four cases on nouns. The result was a language, or 
rather a dialect, with two genders and no case inflections on nouns.” 


2. Iam very grateful indeed to Lars Gunnar Andersson, Torben Arboe, Christine Bolze, 
Magdalena Charzynska-Wojcik, Alida Chevalier, Grev Corbett, Gunther De Vogelaer, Martin 
Durrell, Tom Geddes, Jarich Hoekstra, Ernst Hakon Jahr, Jurij Kusmenko, Rajend Mesthrie, 
Agnete Nesse, Inge Lise Pedersen, Gro-Renée Rambo, Gertrud Reershemius. Jurg Schwyter, Wim 
Vandenbussche, Bert Vaux, Alastair Walker, Roland Willemyns, and Ilse Wischer for their help 
with and advice on this paper. I hope they will forgive me for having pestered them for informa- 
tion and comment. Jan Terje Faarlund also helped enormously, even though he didn't know it. 
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the natural assumption is that [the masculine and feminine] animate gender(s) 
[came] to be extended to inanimates through various processes, although the 
full story of how a system like e.g. that of French and German arises has not yet 
been told. 


So in North Indo-European “each noun was arbitrarily assigned to a concord class” 
= gender, and “there were three genders, conventionally called masculine, femi- 
nine, and neuter” (Ringe 2006: 24). The conventional labels “masculine” and 
“feminine” are of course due to the fact that nouns referring to male and female 
animates were still non-arbitrarily assigned to the these classes respectively. How 
exactly the new feminine gender developed is the subject of much discussion - as 
Corbett says, the origins of the Indo-European gender system “lie so far back” in 
time that much work on them “has been largely speculative” (1999: 168) - but 
Luraghi (2009), and Miller (1977, 2010) do provide helpful accounts. 


Totenmalet 


It is this feminine gender that the present paper is most particularly focussed on. 
Many modern Indo-European languages still preserve the three genders. The 
Germanic dialect Totenmialet? (Faarlund 2000) has a grammatical gender system 
which still shows the faithful preservation, over a period of at least 6,000 years, of 
all three North Proto-Indo-European genders. 

The gender system of Totenmalet, though nothing like as pervasive as that of 
some other modern Indo-European languages such as Polish or Greek, distin- 
guishes between masculine, feminine and neuter in a number of important nomi- 
nal categories to the extent that gender markers occur frequently in all utterances 
of any length. Faarlund’s grammar (2000) shows that the categories involved are 
rather numerous. They include: 


indefinite articles: ‘a, an’ 
MASC FEM NEUT 
einn ei eitt 


postposed definite articles: ‘the’ 


MASC FEM NEUT 
NOM. SG -en -a -e 
DAT.SG -a -en -i 
NOM.PL -a -en -a 


3.  Totenmalet is the Indo-European North Germanic variety spoken in the Norwegian dis- 
tricts of Eastern and Western Toten and the immediately surrounding areas, 60+ miles/100+ kms 
to the north of Oslo and to the west of the large lake, Mjosa. 
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the adjective ‘little’ (indefinite): 
MASC FEM NEUT 
litn lita lite 


possessive pronouns: 


MASC FEM NEUT 
Ist sG minn mi mitt ‘my 
2ndsc = dinn di ditt ‘your 
3rd REFL_ sinn si sitt ‘his, her, its’ reflexive’ 
‘OWN’ eien eia eie ‘own as in ‘my own etc. 


NEG. SG. INDEF. pronoun: no 


MASC FEM NEUT 
ingen inga ittno 
3rd sG. pronouns: 
MASC FEM NEUT 
NOM. heenn hu dee ‘he; ‘she; ‘it’ 
OBL hennom henner de ‘him, ‘her’, ‘it 


Crucially, although the third-person singular pronouns henn ‘he’ and hu ‘she’ are 
generally used to pronominalise human and other animates according to their 
natural gender, i.e. sex, the system as a whole, as in older Indo-European as well as 
in modern varieties such as Standard Norwegian Nynorsk and German, still oper- 
ates on an entirely grammatical-gender basis,: 


the wall ... it vegg-en...heenn  ‘wall-the ... he’ 

the book ... it bok-a... hu ‘book-the ... she’ 

the child ... it barn-e ... de ‘child-the ... it 
Bergen 


It is interesting to note, therefore, that there are also many modern Indo-European 
varieties which have actually reduced the number of genders from three back to 
two again, precisely through the loss of the originally innovative feminine. 
Intriguingly, Totenmalet is spoken in some geographical proximity to such 
varieties — closely related Germanic varieties which have only two genders. For 
instance, the Norwegian dialect of the major city of Bergen, which is about 
180 miles/300 kms west of Toten, has acquired a system with only two genders, 
common and neuter. This reduction was effected through the loss of the newest of 
the three Indo-European genders, the feminine, which became merged with the 
masculine into a new ‘common gender’. Thus, in Bergen, the distinct feminine 


80 


Peter Trudgill 


definite and indefinite articles have disappeared and the markers of the common 
gender are all original masculines e.g. bok-en cf. Totenmalet bok-a ‘the book’, en 
bok cf. Totenmalet ei bok ‘a book. This is more than just a formal matter, since the 
originally feminine nouns are now also pronominalised using the original mascu- 
line pronoun han ‘he; corresponding to Totenmalet henn. The feminine third- 
person singular pronoun hon, corresponding to Totenmalet hu ‘she’ does survive, 
but is used only for natural gender i.e. to pronominalise feminine nouns referring 
to female humans. Thus in the basilectal Bergen dialect (Nesse 2005) we have: 


BERGEN DIALECT cf. TOTENMALET 

vegg-en ... han ‘wall-the ... he’ vegg-en... heenn ... ‘he’ 
bok-en ... han book-the... he’ — bok-a... hu .. She’ 
jent-en ... hon ‘girl-the ... she’ jint-a... hu ... she’ 


Very strikingly, in Bergen hon is not even used for female non-human animates: 
one says ku-en ... han ‘cow-the ... he. Hon is restricted to women and girls. Bergen 
thus has a female equivalent of the Polish “masculine personal” or “virile” gender 
which is confined to masculine humans.‘ 

This is not, however, the same situation as is found in other two-gender 
Scandinavian varieties. Standard Swedish (Skrzypek 2010) and Standard Danish 
have also merged feminine with masculine; but they do not have pronominalisa- 
tion of the type boken ... han. Rather, they have developed a newer system in which 
han and hun/hon are both reserved for animates, while inanimate common gender 
nouns (including original feminines) are pronominalised with the new pronoun 
den ‘it, derived historically from the masculine distal demonstrative den ‘that’ 
(see further below). 

There are a number of other Indo-European varieties which have only two 
genders. All the western Romance languages come into this category, but in these 
cases the reduction from three to two genders was achieved through the loss of the 
Latin neuter, not the feminine. Similarly in the two-gender dialect of Slovenian 
found in the bilingual area of Sele Fara, southern Austria (Priestly 1983), it is the 
neuter gender which has disappeared and been replaced by the masculine. And 
this is also true of Latvian, Lithuanian, Welsh, Gaelic, Hindi and Pashto. 

In this paper, however, I focus specifically on the loss of the feminine, not least 
because this is more perplexing than the loss of the neuter. Grammatical gender 
systems of the Indo-European type are based on “small systems of noun classes as 
in French or German” (Dixon 2002: 452) where, as we saw above, the class assign- 
ment of inanimates is arbitrary but the actual categories are derived from the 


4. This makes Bergen rather unusual in the European context; but elsewhere many Dravidian 
languages, for instance, have a “feminine personal” gender (Grev Cobett p.c). 
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natural gender of animates, i.e. they are semantically derived sex-based gender 
categories. It is therefore very odd, as Hickey (2000) points out, to find varieties 
such as modern Standard Danish and Standard Swedish which have actually lost 
the fundamental distinction in inanimates between masculine and feminine which 
the whole sex-based grammatical gender system is founded on. 

My goal here is to explore the phenomenon of feminine gender loss and main- 
tenance in West Germanic varieties such as Dutch, and in North Germanic 
varieties such as Totenmalet. Within the Germanic language family there are many 
varieties which, like Totenmalet, preserve the original three genders - Icelandic, 
for instance. On the other hand, there are varieties - English and Afrikaans - which 
have lost grammatical gender (as opposed to natural gender) altogether.° And 
there are yet others, like Bergen Norwegian and Standard Danish, which have lost 
the feminine and thus have only two genders.° 

My exploration will be carried out from the perspective of sociolinguistic ty- 
pology (Trudgill 2011a). Sociolinguistic typology is a form of linguistic typology 
which seeks to make links between social and linguistic structures: I will be asking 
whether it is possible to provide sociolinguistically-based explanations for why dif- 
ferent Germanic varieties vary to the extent of having respectively one, two, or three 
genders’ — where I am using the term “explanation” in what McMahon refers to as 
a “lower-key” kind of way, to refer to “relief from puzzlement about some phenom- 
enon” (1994: 45). Can we give a sensible sociolinguistic account of why Totenmalet 
has preserved the feminine gender while other Germanic varieties have not? What 
different social factors might be involved, separately or in combination? 

We can begin by considering feminine-gender loss in the North Germanic 
languages generally. Haugen (1976: 288) says that the three grammatical genders 
are preserved “in the overwhelming majority of Scandinavian dialects down to the 
present” — including Totenmalet of course, as well as all varieties of Icelandic and 
Faroese — but that certain varieties have reduced the number of genders to two. 
These latter varieties are the Norwegian of Bergen already mentioned, together 
with Standard Danish and Swedish, plus a number of Danish (Ringgaard 1974; 
Arboe 2008) and Swedish dialects. The feminine gender is most weakly preserved 
in contemporary Swedish in those varieties which are spoken in the vicinity of 
Stockholm, and in the (formerly Danish) dialects of Skane/Scania, the area nearest 


5. Other non-Germanic Indo-European languages which have lost gender are Cappadocian 
Greek (Janse 2009), Armenian, Farsi, and Bengali. 

6. ‘There is also one bizarre variety, Norwegian Bokmal, which could be said to have 
two-and-a-half genders. 

7. To write of varieties with “one gender” is obviously anomalous, but I notice that I have done 
it anyway. 
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to Copenhagen (Andersson 2011). It is now also being lost elsewhere: Sandstr6m 
(2010) for instance shows that the Finland-Swedish dialects of Nyland have been 
moving from a three-gender to a Standard Swedish-type two-gender system over 
the last hundred years. 

It is encouraging therefore to notice that sociolinguistic explanations have been 
advanced for this development: a number of writers have ascribed the simplifica- 
tion to language contact (e.g. Kusmenko 2000). For Bergen, Jahr (1995, 1998, 2001) 
has developed a contact-based explanation for the loss of the feminine in which he 
points out that the only other part of Norway where this distinction has been lost 
is in the area of strong Sami-Norwegian contact in the Norwegian North. He ar- 
gues that the Bergen development was the result of the “heavy influence of lan- 
guage contact between Norwegian and Low German” (2001: 100) during the Hanse 
period. Nesse (2002) makes the same kind of point in more detail. And Faarlund 
(2004) also agrees, as per the quotation cited at the beginning of this paper. 

This, however, turns out to be controversial. Perridon (2003: 254) strongly dis- 
putes these language-contact based arguments: “the hypothesis that language con- 
tact with Low German was (co)-responsible for a number of changes in the 
grammar of the dialect of Bergen does not explain anything” The traditional expla- 
nation for this development, which he favours, is simply phonological. In Bergen, 


after the quantity shift, there were no longer long segments in unstressed or lightly 
stressed syllables, the opposition between long and short n’s in the definite forms 
of nouns was hence lost: hestinn ‘horse DEF-sG-m’ became hesten, with the same 
ending as bokin ‘book DEF-sG-F’— boken. 


In other dialects, though, “short final -n fell, often after having nasalized the pre- 
ceding vowel [i], [z] or [e], which later turned into [a], e.g. solin > soli > sola 
‘sun-DEF, or soli in non-nasalizing dialects’, and so the gender distinction was pre- 
served. In Totenmalet, then, the modern feminine definite article -a is the result of 
the loss of final /n/ in -in, with concomitant nasalization of the preceding /i/, fol- 
lowed by the lowering of /i/ to /4/, and then denasalisation. So Perridon would 
seem to be correct to the extent that the merger of the masculine and feminine 
articles was linked to a phonological change. 

In Perridon’s disagreement with Jahr we can see a conflict between two oppos- 
ing types of attitude towards explanations for linguistic changes. On the one hand 
we see sociolinguists looking to language contact and other social factors as 
explanations, while on the other hand there are scholars who are part of a tradition 
of linguists who see no need for any explanation other than the linguistic-internal 
(see Delsing 2002). Some of these linguists go as far as to overtly question the role 
of sociolinguistic factors at all - Perridon is only one of a number of 
non-sociolinguists who are contact-sceptics who feel no need for a sociolinguistic 
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explanation for something which seems to them to be just a straightforward con- 
sequence of internally generated linguistic change. In the Bergen case, Perridon 
argues that it was phonological change that was the key; but a number of linguists 
have also argued for the role of morphology, e.g. Enger (2010). 

My own goal in this paper - to attempt to account for why gender has been re- 
duced in some varieties but not others — leads me to take the side of Jahr in this 
dispute. While Perridon’s account of the sound changes involved is undoubtedly cor- 
rect, what he does not attempt to do is to explain why the processes of phonological 
change led to the two different grammatical outcomes in the two different varieties. 

The big question is: why did sound change lead to the development of the loss 
of the feminine gender in precisely the Bergen variety, and to the preservation of 
the feminine in precisely (for instance) the Toten variety? As Kusmenko (2000) 
argues, to say that the merger was brought about by phonological change is surely 
to confuse cause and effect. What was it about the situation - I would say the so- 
ciolinguistic situation — in Bergen that led to the one development; and what was 
it about Toten that led to the other? Perhaps Perridon’s point of view would be that 
there is nothing specific about Bergen or Toten that is relevant here at all - that 
there is a random element in linguistic change which means that it is impossible to 
account for why a particular linguistic change happened when and where it did, 
rather than at some other time and in some other place. 

But the instinct of the sociolinguist is at least to try to move towards a further 
reduction in puzzlement. 


Copenhagen 


And, for the sociolinguist, there is something very obviously sociolinguis- 
tic-typological to be said about the Scandinavian loss of the feminine. Note 
Perridon’s own description (2003: 240): 


Masculine and feminine gender have coalesced into a common gender ... in 
Copenhagen, and hence in Standard Danish; in Central Sweden (the area around 
Stockholm; Uppland) and hence in Standard Swedish; and in Bergen. 


The denial of the role of contact means that for Perridon it has to be a total coinci- 
dence that three of the areas in which feminine-gender loss has occurred just hap- 
pened to be, at the relevant period in history, the largest city in Denmark, and the 
largest city in Sweden, and the largest city in Norway. 

From the point of view of sociolinguistic typology, however, this is actually 
most unlikely to be just a coincidence. Kusmenko (2000) makes the point that it 
cannot be coincidental that at the time of the supremacy of Low German, the 


84 Peter Trudgill 


language of the Hanseatic League, it was these three cities with their large popula- 
tions of foreigners which lost the feminine, while the immediately surrounding 
rural areas did not. (Eventually, as we just noted, the loss did spread into the envi- 
rons of Stockholm and Copenhagen - the dialects of Sealand outside Copenhagen 
merged masculine and feminine at the beginning of the 20th century - but in the 
case of the dialects around Bergen it has still not happened.) 

Pedersen (1999) writes that in Copenhagen the masculine-feminine distinc- 
tion was totally lost during the period 1450-1600. During this time Copenhagen 
was experiencing a five-fold increase in the number of its inhabitants; and koinei- 
sation as a result of the dialect contact which accompanies in-migration and dia- 
lect-mixture during such urbanisation is a very common occurrence (Trudgill 
1986). Pedersen has just such a koineisation explanation for gender merger in 
Copenhagen - and hence in Standard Danish: the gender merger was due to con- 
tact between the dialects of Sealand and (the now Swedish) Scania, as well as Low 
German, which would at that time perhaps have been somewhat mutually intelli- 
gible with Scandinavian (see below). 

From a mechanical linguistic perspective, it is relatively simple to see how 
gender loss resulted from dialect-levelling. For example, in the three varieties in- 
volved in the dialect contact, eight different forms would have been found for the 
first person singular possessive pronoun. All eight forms were phonologically very 
similar, but the masculine/feminine distinction was effected in different ways in 
the different varieties, and the situation was further complicated by the three dif- 
ferent case forms in Low German. 

The eventual new Copenhagen dialect outcome represented a kind of simpli- 
fied common-denominator compromise between the eight forms, in which the 
masculine-feminine distinction had gone missing: 


Dialect-mixture input: ‘my’ 


MASC FEM 

SEALAND min men 

SCANIA min mi:n 
Low GERMAN 

NOM min mi:ne 

ACC mi:nen  mi:ne 

DAT mi:nem  mi:ner 
Koine output: mi:n 


We can suppose that similar processes may have been at work in Stockholm and 
Bergen. 
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And there may also have been a more specific role for Low German here: 
Haugen (1976) suggests that the entry of new loanwords from Low German into 
the dialects of the big cities might have caused problems of gender assignment 
which contributed to the merger of masculine and feminine. 


Afrikaans 


In his anti-contact argument, Perridon refers in particular to the fact that 
Jahr - referring to the large-scale presence of Low German-speaking Hanseatic 
traders in Bergen — says that: “Siden voksne som lerer nye sprak, kjennetegnes ved 
a vere ‘imperfect adult learners’ i forhold til hva barn er, vil vi vente a finne flere og 
tydligere sprakkontaktresultater i Bergen enni Oslo og Tonsberg.”’ Jahr’s explanation, 
that is, has to do with the simplification which is well-known to be associated with 
the imperfect language learning abilities of adults in contact situations. 

Perridon (2003: 254) counters with the observation that it cannot have been 
like that because “in Belgium there was an intensive language contact between 
French and Dutch, but just in this area the three gender system is still very much 
alive’, and therefore a contact-based explanation for why gender loss occurred in 
Bergen does not make sense. 

It is particularly odd that Perridon has chosen Dutch as an example to make 
his anti-contact case here, because there is one variety of Dutch which is univer- 
sally accepted as having undergone simplification, including gender loss, precisely 
as a result of language contact (see Duke 2010). The Southern African language 
Afrikaans is derived historically from Dutch. And Roberge (1995) explains that it 
has undergone much simplification relative to its source language because of its 
history of contact with indigenous and other languages: there were “three groups 
primarily responsible for the formation of Afrikaans - European settlers 
(from 1652), the indigenous Khoekhoe, and enslaved peoples of Africa and Asia 
(from 1658)” (Roberge 1995: 68). 

As a striking example of this simplification, Afrikaans has lost not just the 
feminine gender but grammatical gender altogether. “Gender in the noun had vir- 
tually disappeared by 1797” (Roberge 1995: 78) - which was about 150 years after 
the first Dutch settlement. The modern Afrikaans system is now one of natural 
gender in which the definite article is die for all nouns, singular and plural, and the 
indefinite article is ‘1 [a] (Donaldson 1994) e.g.: 


8. “Compared to children, adults learning new languages are characterised as being “imperfect 
adult learners’, and we would therefore expect to find more - and more obvious - consequences 
of language contact in Bergen than in Oslo or Tonsberg,’ 
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die man ‘the man 
die vrouw ‘the woman’ 
die huis ‘the house’ 


Natural gender appears under pronominalisation: the corresponding third-person 
singular pronouns are man - hy (animate masculine ‘he’), vrouw - sy (animate 
feminine ‘she’), huis — dit (inanimate ‘it’). 

Typologically Afrikaans is best described as a creoloid (Trudgill 1983): 
although it has undergone simplification, there has been a continuous native- 
speaker tradition throughout its history, with the Dutch/Afrikaans language hav- 
ing been transmitted from one generation to another in the normal way since the 
arrival of Dutch speakers in the 1600s. Many modern Afrikaans native speakers 
are thus the direct genetic as well as cultural descendants of earlier generations of 
Dutch speakers. But according to Combrink (1978), the population of Cape Town 
in 1795 was about 1,500; and about 1000 of these were “slaves that had come from 
Guinea, Mozambique, and the far East” and “people of Khoin extraction, most of 
whom had given up their mother tongue and could speak only the Cape Dutch” 
(Combrink 1978: 77). Dutch was acquired by very large numbers of non-native 
speakers, initially speakers of the indigenous Khoi and San languages and subse- 
quently speakers of Bantu languages, plus immigrant speakers of German, French, 
the Scandinavian languages, Portuguese, Malagasy, Malay, Indonesian languages, 
and others. There was much racial mixing and intermarriage, as the current 
“Coloured” population of South Africa shows. Women in particular were quite 
likely to be native speakers of some language other than Dutch. This led to a situ- 
ation where the nature of transmission became crucial. So many people initially 
acquired and used Dutch as an L2 that, in later generations, children were born 
who became, as it were, native speakers of non-native Dutch. A non-native variety 
became nativised and in time the dominant, and subsequently sole, variety. 

Dialect contact was also important, as is often the case in colonial situations 
(Trudgill 2004a), and as it undoubtedly also was in Copenhagen. Combrink (1978) 
sees a large role for dialect contact in the simplification which occurred in Cape 
Town and ascribes this in part to “interdialectal noise deletion’. If we compare 
Afrikaans with the Scandinavian cases we have examined, however, one strong 
inference is that dialect contact was only a minor influence here. In Copenhagen, 
dialect contact led to the reduction of grammatical genders from three to two. In 
Afrikaans, on the other hand, what has happened has been a total Joss of gram- 
matical gender, suggesting that language contact can have much more severe con- 
sequences than dialect contact. 

The role of contact in the formation of Afrikaans, then, was clearly so massive 
that it would be foolish to ascribe the loss of gender to any other cause. And in 
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adult language contact situations, grammatical gender is very much a prime 
candidate for loss. Dahl points out that many linguistic phenomena are 
cross-linguistically dispensable, i.e. there are languages which manage very well 
without them - and gender is clearly one such. As Hickey (2000) says, “grammatical 
gender ... is largely semantically redundant”. Gender systems are also ‘mature phe- 
nomena’ which take a very long time to develop diachronically and which tend to 
be “most recalcitrant in second language learning” (Dahl 2004: 286). It is therefore 
not at all surprising that loss of gender “appears to be highly correlated with the 
degree of external contacts” (Dahl 2004: 200). 


Dutch/Flemish 


To be fair to Perridon, however, we should also consider in more detail his spe- 
cific point about the consequences of contact between Dutch and French in 
Europe. From the point of view of sociolinguistic-typology, though, the 
Germanic/ Romance border does not actually provide a good comparison with 
Bergen at all. What Perridon says about contact in Belgium is of course quite true; 
but there is a failure of sociolinguistic sensitivity and imagination here. When he 
attempts to shine light on the situation in Bergen by pointing out, as noted above, 
that “in Belgium there was an intensive language contact between French and 
Dutch’, Perridon is not comparing like with like. He is using the term “contact” in 
an unsubtle and undifferentiated way, and one which turns out to be somewhat 
naive from a sociolinguistic point of view. The fact is that we need to analyse the 
concept ‘contact’ in more detail. 

In our search for explanatory factors we need to distinguish sociolinguistically 
between two major types of language contact which have two very different lin- 
guistic outcomes. As I have argued in Trudgill (2011a), simplification in language 
contact, such as loss of gender, does not result from contact as such. It results from 
post-critical threshold learning (Lenneberg 1967), which is what Jahr is clearly 
talking about in his quotation above. Simplification will occur only to the extent 
that adult second-language learning dominates in short term-contact situations. 

The contact between French and Dutch that Perridon refers to is precisely the 
sort of contact where we would not expect to observe simplification at all 
(see Trudgill 2010a). The situation on the language frontier in what is now Belgium 
is very different from the situation in mediaeval Bergen, where contact involved Low 
German-speaking adults who were mostly short-term migrants. In Belgium the 
contact was and continues to be a rather stable, very many centuries-old contact 
environment involving whole communities, including children, rather than the 
“imperfect adult learners” that Jahr refers to. Ryckeboer (2002) speaks, for instance, 
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of “generations and generations of bilinguals” in the French/Flemish border contact 
zone. And Willemyns (2002) writes of the “mutual influence due to the extremely 
long duration of language contact” and of fluent bilingual speakers who practice 
code-switching. 

In French Flanders,’ the degree of intensity of the Romance-Germanic contact 
and the length of its duration can be judged from the extent to which grammatical 
influence has been mutually exercised by the local varieties of West Flemish and 
Picard on each other. In the Flemish varieties of France we find conjunctions such 
as puisque dat ‘since’ parce que ‘because, soit ‘that is’; and prepositions such as da 
force van ‘by dint of’ grace van ‘thanks to’ (Ryckeboer 2002).“The Flemish dialect 
in France also shows syntactic characteristics that reflect ... the age-old influence 
of French” including the “extra-position of some adverbial complements and even 
inherent complements’, which are “very common in French Flemish’, while 
“another syntactic feature of the French Flemish dialect is the almost complete 
absence of inversion (for 97%) after topicalisation of a non-subject constituent” 
(Ryckeboer 2002: 37). 

Similarly, according to Callebout & Ryckeboer (1997), the local Picard dialects 
demonstrate Germanic features such as preposed adjectives: du frais café ‘fresh 
coffee’; and the postposition of prepositions and adverbs as in crier aprés ‘shout 
after’, venir avec ‘come with’, faire sans ‘do without. Compounding can also occur 
in the Germanic manner, as in piedsente cf. French sentier pédestre and Dutch 
voetpad ‘footpath’. 

Perridon then continues his argument by making a contrast between Belgium 
and the northern part of the Dutch-speaking area. He writes that in Holland, on 
the other hand, there was little or no contact between Dutch and French (or other 
languages), yet the dialects in this area lost the distinction between feminine and 
masculine gender for reference to non-persons (2003: 254). 

It is of course quite true that the metropolitan varieties in the north of the 
Netherlandic-speaking area, including Standard Dutch, have merged feminine 
and masculine, while dialects of the Flemish south (and neighbouring areas) have 
retained the three original genders (De Vogelaer 2009). Standard Dutch has the 
definite article forms de (common gender - masc./fem.) and het (neut.), while the 
southern dialects have den (masc.), de (fem.), and ‘t (neut.), with corresponding 
differences of adjectival agreement (Taeldeman 2005: 62): 


‘the thin farmer ‘the thin farmer’s wife 


FLEMISH den dunnen boer de dunne boerin 
STANDARD DutTcH | de dunne boer de dunne boerin 
cr. AFRIKAANS die maer boer die maer boervrou 


9. The Dutch-speaking or formerly Dutch-speaking area of northern France, witness place- 
names such as Dunkerque and Hazebrouck. 
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Importantly, the effects of this merger can also be seen in grammatical (as opposed 
to natural) gender: 


“The table ... it’ 
FLEMISH De tafel ... ze (=‘she’) 
STANDARD DutcH De tafel... hij (="he’) 
CF. AFRIKAANS Die tafel ... dit (=‘it’) 


The noun tafel was originally feminine in all varieties of Dutch, and is therefore 
pronominalised by “she” in varieties which preserve the feminine. But in Standard 
Dutch, where the feminine has been lost, all common gender nouns regardless of 
historical gender status are now pronominalised by ‘he’ - except for animate femi- 
nines such as vrouw ‘woman’ - very much as in Bergen Norwegian (De Vogelaer 
& De Sutter 2011). 

Consider, though, how nicely this fact ties in with what I said above about 
urbanisation, dialect contact, koineisation, and simplification. Gender simplifica- 
tion in northern Dutch occurred during a period of rapid population expansion 
and immigration in the Golden Age (Geerts 1966; and see further Trudgill 2012) 
and subsequently, in the now highly urbanized north of the area, focusing on what 
is now known as the Randstad — the urban conglomeration with a modern popula- 
tion of over 7 million based on the four major cities of Amsterdam, The 
Hague/Den Haag, Rotterdam, and Utrecht; and including also Delft, Dordrecht, 
Hilversum, Harlem/Haarlem and Leiden, whence it has spread out into surround- 
ing rural areas without, so far, reaching the far south of the Dutch-speaking area 
(see more on spatial diffusion below). This, then, is another example of dialect 
contact leading to gender reduction rather than loss. 


High German 


I have to concede, however, that Perridon is not the only scholar to object to the 
contact-and-simplification thesis. Campbell and Poser (2008: 362) take a similar 
line to Perridon’s by citing the case of High German. ‘They discuss the approach 
taken to contact and simplification in e.g. Trudgill (2002), and pose a question 
which is similar to the one posed by Perridon for Bergen. They ask: if Trudgill is 
right about simplification being a consequence of language contact, then why has 
German, a language which has a long history of being learned as an L2, and being 
used as a lingua franca, not been simplified like the mainland Scandinavian lan- 
guages that Trudgill points to? 

It is certainly true that Standard German has retained the three Indo-European 
genders, and this applies too to the nonstandard dialects of central and southern 
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Germany, Switzerland, and Austria. It is also true that Campbell and Poser’s ques- 
tion needs answering. In my view, though, this is really not the right question. The 
question should not so much be why the acquisition of German by non-native 
speakers has not led to simplification. After all, in a normal situation where a con- 
tinuous native-speaker tradition is maintained (Thomason & Kaufmann 1988), 
why would it? 

When pidgins develop, this is because there is an intergenerational break in 
transmission, and the result is effectively a new language. But where native speak- 
ers transmit their language from one generation to another in the normal way, as 
with German, the fact that there may also be non-natives around who are speaking 
simplified versions of the language will normally have no effect on this transmis- 
sion whatsoever. No language is currently used more by non-native speakers than 
English, but the mainstream native varieties of English are currently showing no 
serious signs of large-scale, ongoing, new simplification. 

The real question is the reverse of Campbell and Poser’s question. Consider 
Bergen once again. It has been argued, as we saw above, that simplification oc- 
curred in Bergen as a result of contact with Low German speakers. But how did 
this work? How are we to unpack the phrase “as a result of”? It would be no sur- 
prise if the Norwegian spoken by native speakers of Low German was simplified, 
as undoubtedly it was: Wessén (1970: 169) writes “mit der Zeit gingen die 
Deutschen zwar dazu tiber, Nordisch zu sprechen, aber sie vermochten nie, den 
richtigen Gebrauch der nordischen Beugungsformen zu lernen”!°? But the big 
question is the one Campbell and Poser did not ask: why should this simplifica- 
tion have spread to the speech of native Norwegians, and thus to subsequent 
generations? 

Again, the focus has to be narrowed onto type of contact — it is not adequate to 
refer to “contact” as an undifferentiated concept. And the clue here lies in the case 
of Afrikaans, where inter-generational transmission did continue, but where as we 
saw above the situation was hardly “normal” because of the numbers of speakers 
of different languages involved. The answer to the Bergen question must lie, as 
with Afrikaans, in two factors central to contact processes: demography and trans- 
mission. According to Rambo (2010), citing Helle (1982), the population of Bergen 
in the 1400s may have been around 7,000. Of these perhaps 2,000 were Germans, 
with that number rising to 4,000 or so (then out of 9,000) in the summer months. 
There were also other foreigners, including Dutch, English and Scottish. Else- 
where, Blomkvist (1979) says that in the 100 years from 1250, between a third and 
a half of the population of Kalmar, Sweden, were German. And Dahlback (1998) 


10. “As time went by, the Germans did go over to speaking Nordic, but they never managed to 
learn the correct usage of the Nordic inflectional forms.” 
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shows that in Stockholm two-thirds of the taxpayers in 1460 were German, though 
their proportion in the population overall was doubtless lower than that. 

My suggestion is that when the proportion of non-native speakers becomes as 
close to 50% as this - the proportion of non-native speakers of Norwegian in 
Bergen was certainly very high (maybe 4,000 out of 9,000) - the number of face- 
to-face dialect-contact interactions, and therefore potential instances of accom- 
modation (Trudgill 1986), would have reached a threshold level at which some 
aspects of the non-native variety could transfer to the native. If the demography is 
right, natives can and will accommodate, to different extents, to non-natives. And 
the point about “different extents” is important: overall simplification in Bergen 
Norwegian is actually very slight, especially when compared to Afrikaans, al- 
though it does extend to features other than gender-loss e.g. adjectives do not re- 
ceive a plural ending when used predicatively - vi er trott ‘we are tired’ as opposed 
to plural trotte in other dialects (Jahr 1998). 

And note that although there was a very long-term presence of these foreign 
communities in Bergen and other Scandinavian cities, the non-Norwegians did 
not typically settle, as individuals, for the long-term. They formed large but pre- 
dominantly male and, unlike in South Africa, transient populations. In short-term 
contact, which is what this must have been for individual adults, we know that 
dialect mixture and levelling occur as a result of contact via accommodation in 
face-to face interaction (Trudgill 1986). In the Bergen case, if Low German and 
Norwegian were mutually intelligible (Duke 2010), as they may well have been at 
this time - Nesse (2005) says that a normal communication pattern was one of 
passive bilingualism — we can count Low German/Norwegian interactions as in- 
stances of dialect contact, which tallies nicely with the fact that it was gender 
reduction and not loss which occurred. In-migration from rural areas of western 
Norway involving dialect contact will have been important too. 


Frisian and low German 


Campbell and Poser’s observation that High German, like Totenmialet, has re- 
tained the original three Indo-European genders is thought-provoking for a socio- 
linguist who is considering the interplay of contact and simplification, and who 
notes that the closely related northern Dutch has not retained this system. What 
can we make of the fact that, along the West Germanic dialect continuum, it is the 
dialects of central and upland southern Germany, and the Alpine German variet- 
ies of Switzerland and Austria, which have preserved the three genders, while it is 
dialects of the coastal lowlands of the Netherlands which have lost the feminine? 
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This question seems to become even more relevant when we notice that this 
observation about coastal areas does not just apply to Dutch. If we travel north- 
wards along the North Sea coast out of the Dutch-speaking area, we come into 
West Friesland; and it is intriguing to note that West Frisian follows metropolitan 
northern Dutch in also having the syncretism of masculine and feminine genders. 
Old Frisian!! did have the original three Germanic genders (Bremmer 2009) — as 
seen for example in the nominative singular definite articles thi, thiu, thet. But 
Modern West Frisian has only two genders, as a result of the loss of the feminine 
(Tiersma 1985: 47). 

The only West Frisian exception is the island dialect of Schiermonnikoog, 
where the feminine survives (Hoekstra 2001a). Wahrig-Burfeind (1989) finds this 
exception significant, and has a sociolinguistic explanation for this conservatism 
in terms of the relative isolation of the island community. In fact, in her extraordi- 
narily detailed study of the distribution of gender reduction in the southern North 
Sea region generally, she stresses the overall importance of contact, isolation and 
spatial diffusion. 

This brings us, then, to consider another important sociolinguistic factor in 
addition to contact-and-simplification, namely differential accessibility to the 
geographical diffusion of linguistic innovations (Trudgill 1973). Wahrig-Burfeind 
talks of “das Phanomen der arealen Gerichtetheit des Genussynkretismus” 
(1989: 295) - there is a spatial directionality to the spread of the change - and 
shows that the loss of gender is an innovation which has spread from one area to 
another even across language boundaries. The sociolinguist can further point out 
that this geographical diffusion will be the result of contact of the particular long- 
term type that comes with co-territoriality or at least proximity. 

The importance of diffusion as a factor is borne out by what we find if we con- 
tinue our journey eastwards along the North Sea coast from West Friesland and 
enter the Low German area. It is initially discouraging to find that according to the 
three chapters on Low German published in Russ (1990) - Goltz & Walker (1990), 
Durrell (1990), and Schonfeld (1990) - Low German has three genders, just like 
High German, something which is confirmed by the Low German grammar of 
Lindow et al. (1998). 

Crucially, though, these works do show that manifestations of gender are less 
numerous than in High German, there being a much smaller role for case in mod- 
ern Low German. The masculine/feminine distinction in particular survives only 
rather tenuously. As described by Durrell, manifestations of the masculine/feminine 
distinction in Westphalian are few, with the nominative singular of the definite ar- 
ticle being de for both genders, and only the oblique (the result of an earlier 


11. Extant texts in Old Frisian date from no earlier than the 14th century. 
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simplifying coalescence of accusative and dative) differing in having den for the 
masculine and de for feminine (cf. neuter dat). Lindow et al. (1998: 143) concur that 
it is only in the oblique singular case that the masculine and feminine definite arti- 
cle are regularly distinguished in Low German generally. Adjectives, in some variet- 
ies only, may also take an -n in the masculine oblique case in the strong declension. 
And demonstratives, possessives and indefinite articles may distinguish between 
masculine and feminine, though in many dialects there is no distinction at all. 

That is, in much of Low German, it looks as if all it would take for the total 
formal merger of masculine and feminine to occur would be for a change to hap- 
pen such that the word-final -n of the masculine singular oblique definite article 
was lost (and for the same change to happen in the case of strong adjectives in 
those varieties where they are also distinct). 

Importantly for our purposes, though this is not mentioned in the major refer- 
ence works, it actually turns out that this loss of -n is precisely what has in fact 
happened in a number of different varieties of Low German - though of course for 
a total gender merger to take place, loss of -n was not all that had to happen 
(cf. Nesse 2005): we can only be confident that native-speaker awareness of mas- 
culine vs feminine gender membership has been lost when the relevant pronouns 
and other proforms have ceased to be distinct as well. 

Here a comparison between Bergen and Low German is useful. When discuss- 
ing feminine-loss in Bergen above, I actually presented a rather simplified picture 
of the diachronic process. What Nesse in fact shows is that, even after the original 
merger of the masculine and feminine definite articles, the anaphoric pronouns 
used for masculine and feminine inanimate nouns did remain distinct - han and 
hon - for a period. Awareness of the distinctness of the two genders is signalled in 
the writings of Mester Absalon”? (1528-1575) in his etymologically-correct usage 
of both the pronouns hon and han for inanimates (Nesse 2002). It was only later, 
during the 1600s, that hon ‘she’ stopped being used to pronominalise original in- 
animate feminines and replaced by han ‘he’ - and it was only when this occurred 
that the feminine grammatical gender was completely lost. So, in Low German 
also, anaphoric pronouns will have to be considered. 

Happily for our hoped-for insight about the role of spatial diffusion, all the Low 
German varieties with two-gender systems turn out to be coastal. If on our east- 
wards coastal journey we leave West Friesland and enter the Low Saxon dialect re- 
gion of the Groningen area of the Netherlands, it emerges that this does indeed have 
the masculine-feminine syncretism (Wahrig-Burfeind 1989: 103). And then further 


12. Absalon Pedersson Beyer “Mester [Master] Absalon” was a Bergen priest, teacher, and 
writer whose diaries 1552-1572 provide invaluable historical data (Skard 1972: 32f). 
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east, on the other side of the German border, Wahrig-Burfeind says that in Emsland 
the “Synkretismus von Maskulinum und Femininum dominiert” (1989: 57). 
Travelling on, the loss of the feminine has also been totally carried through in 
East Frisia, where contact-induced language shift (see further below) from Frisian 
to Low German occurred some centuries ago. Wahrig-Burfeind (1989: 82-83) 
discusses the merger in East Frisian Low German [EFLG] with data from 
Wiesenhann (1977); and the merger is confirmed in the book-length study of East 
Frisian Low German [EFLG] by Matras and Reershemius (2003: 18) who write: 


EFLG, like other Low German varieties, has a two-gender system: common 
(or non-neuter) and neuter. As in Dutch and the Scandinavian languages, com- 
mon nouns tend to continue historical masculines and feminines, while neuter 
nouns generally continue historical neuters. 


Importantly also, the linguistic details do indicate a truly total gender merger, a la 
Bergen. According to Matras and Reershemius (2003), in East Frisian Low German 
the pronoun zdi ‘she’ is reserved for female animates only. Wiesenhann (1977), as 
reported in Wahrig-Burfeind (1989), also agrees that East Frisian Low German no 
longer uses the feminine pronoun for non-animate referents. As in Standard 
Danish and Swedish - and unlike in Bergen and Dutch - inanimates are pronom- 
inalised by a originally demonstrative pronoun, common ddi or neuter dat, though 
ddi can also used for animates in certain discourse contexts (see Matras & 
Reershemius 2003: 23 for details). 

And then further east again, Wahrig-Burfeind says of the Low German of the 
Oldenburg area that it too lacks a distinct feminine (1989: 57). 

So geography is clearly significant. To the sociolinguist, the fact that a long 
stretch of coastline - from the Rhine delta in the southern Netherlands to the 
mouth of the Weser in northern Germany - is occupied by varieties of three dif- 
ferent languages which have all lost the masculine-feminine distinction, while 
inland Low German and the isolated island West Frisian of Schiermonnikoog have 
not, looks very much as if it is not a coincidence. 


Relief from puzzlement? 


Wahrig-Burfeind (1989) points out that there is a regular linguistic trajectory of 
change which gender loss may pass through. The evidence I have presented so far 
indicates that the trajectory starts from the baseline of the three-gender situation 
where pronouns corresponding to ‘he, ‘she’ and ‘it’ are employed purely on the 
basis of the grammatical gender of the nominal concerned, as still happens in con- 
servative varieties such as Totenmalet and High German. The first actual 
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feminine-loss development chronologically speaking is one where, as in the 16th- 
century Bergen Norwegian of Mester Absalon, feminine nominals are no longer 
formally distinct from the masculine, as a result e.g. of the loss of the distinction 
between masculine and feminine articles, but ‘she’ is still used to pronominalise 
grammatical as well as natural feminines. 

‘The next stage is where ‘he’ takes over in the case of former feminines, except 
for animates, so that the grammatical-gender masculine-feminine distinction is 
lost altogether, as described above for Bergen and Northern Dutch. This is the 
crucial stage for gender merger. 

A further stage is illustrated by the modern Bergen dialect. Although han ‘he’ 
is still the pronoun used for all common inanimate nouns (and indeed for all ani- 
mates except humans), it is subject to a grammatical constraint that I have so far 
not mentioned, such that den, originally ‘that; has to be used instead when the 
pronoun is fronted or stressed (Nesse 2005). This is also what has happened in 
northern Dutch (Shetter 1994: 68), where although hij is used for all common 
gender inanimates, die, also originally ‘that, has to be used with fronting or stress. 
The usage of ddi ‘that’ in East Frisian Low German is not dissimilar (Matras & 
Reershemius 2003).!? Here we can see the beginnings of the next stage, as repre- 
sented in the current Standard Danish/ Standard Swedish systems, and note how 
they came into being diachronically. Recall that in these systems, han ‘he’ has now 
dropped out of use altogether except for male animates, while den ‘that’ has taken 
over and is used for all originally masculine and feminine inanimates in all con- 
texts, not just when fronted or stressed. This change is said to have been completed 
in Standard Swedish during the 1700s (Davidson 1990). 

It also seems that a further step may follow which can lead to the loss of gram- 
matical gender altogether. According to De Vogelaer & De Sutter (2011), a process 
is currently taking place in modern Dutch such that non-animate masculines and 
feminines are increasingly being treated as neuters, something which Curzan (2003) 
says also happened in Old English/Middle English, where grammatical gender even- 
tually disappeared altogether to be replaced by a purely natural gender system. '4 

The question still remains, however, as to why some varieties have travelled all 
the way along this linguistic trajectory, while others have travelled some distance 
only, and yet others have not even started out. Let us consider the potential ex- 
planatory factors. 


13. A similar development has occurred in the (albeit genderless) East Anglian dialects of 
English (see Trudgill 2003), where that is used rather than it in all except unstressed positions: 
That's raining, is it? 

14. This is not to say that other routes are not also possible. A number of modern English dia- 
lects can use he and she for inanimates (Siemund 2007), but according to semantic rather than 
grammatical criteria (see below). 
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Contact and simplification 


Jahr and other sociolinguistic colleagues, e.g. Kusters (2003), show that only socio- 
linguistic factors have any real chance of succeeding in accounting for why gender 
loss occurred in, say, coastal Low German but not in inland Low German, in 
northern Dutch but not in Flemish, and in Bergen but not in Totenmalet. 

Sociolinguists (and others - see Enger (2010) and earlier references cited 
there) see a very clear role for short-term adult contact in the production of sim- 
plification, with gender loss representing a clear case of simplification. The drastic 
nature of the total loss of gender in English and Afrikaans suggests that in these 
two cases it was mainly language contact that was involved: in both cases, the lan- 
guage contact resulted from migration. A similar point can be made for gender 
loss in former Sami-speaking areas, as mentioned by Jahr (1995), and for those 
Finland-Swedish dialects which have lost gender (Kusmenko 2000). 

However, in the case of those varieties in which we see less dramatic simplifi- 
cation, where grammatical gender has not been lost but merely reduced from three 
genders to two, it seems more likely that it was dialect contact that was principally 
relevant, especially if we can regard the Germanic languages at the period in ques- 
tion as all being sufficiently linguistically close to one another that labelling this as 
dialect contact would not be too great a distortion of the facts. I have suggested 
that in the case of Bergen, Stockholm, Copenhagen and the Dutch Randstad it is 
not a coincidence that, at the time of the linguistic changes, these were all large and 
rapidly growing urban areas where dialect contact and therefore koineisation were 
taking place. In the Scandinavian cases, too, contact with Low German was cer- 
tainly involved, with the suggestion being that this is best regarded as dialect-con- 
tact too (Trudgill 2000). 

A somewhat different perspective on contact-and-simplification comes from 
the Danish dialects of West Jutland, which have developed a radically different 
system of semantically/grammatically-determined count vs. mass noun classifica- 
tion (Arboe 2008): the original merged common masculine-plus-feminine gender 
has come to be used as the nominal class for count nouns, while mass nouns be- 
have as neuters. This West Jutish system can certainly be described as 
simplification, since assignment of nouns to classes is no longer arbitrary; and it 
has also been ascribed by Kusmenko (1996, 2000) to contact. He suggests specifi- 
cally that the simplification is due to the large-scale confusion about arbitrary 
masculine/ feminine/neuter gender membership which resulted from contact be- 
tween, and multilingualism in, Jutish Danish, North Frisian and Schleswig Low 
German, where nouns belonged to different genders in the different languages. 
(Indeed, Kusmenko (2000) has a similar explanation for the pattern of simplifica- 
tion involving northern Dutch, West Frisian, and Low German: it is certainly the 
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case that “there are a good many Frisian nouns whose gender differs from that of 
their Dutch cognates” (Visser 2011: 32).) 

However, there may be even more to the role of contact in producing the West 
Jutish system than that. The neighbouring North Frisian language (and certain 
Low German dialects in the same area) have a system which may be a precursor to 
the West Jutish system and indeed maybe responsible for it. Lofstedt (1968: 12) 
tells us that in modern North Frisian certain original inanimate masculines and 
feminine have become neuter, while others have become neuter only when used as 
mass nouns and retain their original gender when used as count nouns. Alastair 
Walker (p.c.) kindly supplies an example: di kafee (masculine) is e.g. a particular 
cup of coffee, whereas dat kafee (neuter) is coffee as a substance.!° 


Contact and language shift 


Contact can also have consequences of another type. Wahrig-Burfeind 
(1989: 75-76), using data from Bock (1933), has shown that another place where 
the masculine/feminine merger has been totally carried through is in the Low 
German of Schleswig, where there is also a pronoun system which operates with 
‘he’ and ‘she’ for animates but has a different common-gender pronoun for 
non-animates. History and geography suggest a role for contact here too: the mod- 
ern two-gender Schleswig Low German variety has been in contact with the two- 
gender dialects of Danish South Jutland (Arboe 2008; Kusmenko 1996) for many 
centuries. And an important factor here is likely to have been the large-scale and 
rather rapid language shift which occurred as speakers of South Jutish abandoned 
their North Germanic mother tongue and shifted to Low German in the 19th cen- 
tury as a result of political developments and border changes - a point alluded to 
in the title of Bock (1933), “Niederdeutsch auf danischem Substrat *, and also dis- 
cussed in Hoekstra (2009). The two-gender system was simply transferred intact 
from Danish to Low German. 

As we saw above, EFLG is also spoken in an area which was Frisian speaking 
until the 1400s, when the Hanseatic League became dominant, and Frisian re- 
mained as a minority language there until the 1500s (Matras & Reershemius 2003). 
So this variety of Low German too can be regarded as a substrate-influenced vari- 
ety resulting from contact-induced language shift. 


15. Fora similar development in English dialects, see Paddock (1991), Wagner (2005), Siemund 
(2007). 
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Contact and geographical diffusion 


Finally, as Wahrig-Burfeind’s work indicates, just looking at a map shows that the 
spatial pattern of feminine loss in modern Germanic is not random. ‘There is a 
large area of Europe stretching from the Rhine delta along the North Sea coast and 
up to and including much of Denmark, with a small break only between East Frisia 
and Schleswig, where the local dialects have the two-gender innovation. This looks 
very much indeed as if it is the result of the geographical diffusion of an innovation 
out of one or a number of centres, in the familiar pattern known to dialectologists 
and geolinguists, with, as is also well-known to students of spatial diffusion, a 
number of outlying urban exclaves to which the innovation has perhaps jumped, 
such as Bergen and Stockholm (Trudgill 1973; Chambers & Trudgill 1998). 

What exactly the route or routes of diffusion were, and what exactly the direc- 
tion of the diffusion was, still remains, I believe, to be investigated. Wahrig-Burfeind 
(1989) points to two geographical kernel areas which seem to her to have led the 
way in gender loss: English, where grammatical gender has disappeared altogether; 
and West Jutish, where it has been replaced by a radically different system. The fact 
that the two different innovating masculine-feminine syncretism areas — Schleswig 
Low German/Jutland Danish; and East Frisian Low German/West Frisian/Dutch 
— are separated from one another by a small conservative three-gender zone involv- 
ing the Low German of the Bremen area and Holstein, opens up the possibility that 
this intermediate zone represents the current limits of the diffusion of gender loss 
outwards from these two kernel areas. Mainland North Frisian is also part of the 
intervening conservative three-gender area. In the island dialects of North Frisian, 
however, the feminine has been lost — as the definite article forms show:'® 

The definite article in Island North Frisian dialects 


MASC FEM NEUT PL 


SYLT di di dit di 
HELGOLAND- de de deat de 
FOHR de det det do 
AMRUM di det det jo 


Note the remarkable fact, however, that on the islands of Fohr and Amrum the 
feminine has been lost through merger not with the masculine but with the 
neuter,” and so these dialects do not perhaps belong to the same wave of 
innovation. 


16. Data very kindly supplied by Alastair Walker. 

17. ‘This is explained by Hoekstra (2001a: 777) as being a consequence of the fact that “women 
came to be addressed with the neuter pronouns” and that the feminine definite article and per- 
sonal pronoun happened to be homophonous - for a fuller account, see Hoekstra (1996). 
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I personally doubt the role of English as a kernel area, for reasons I outline be- 
low. A more likely alternative is that it was the northern Dutch Randstad that was 
the original core area, with the three-gender southern coastal area of West Flanders 
representing the current limits of the southward expansion out of this area. 

‘The overall picture, then, is one where, in regions around the North Sea and 
the Baltic, inland communities have been relatively immune to the spread of the 
gender simplification which has affected the North Sea coast from the Rhine Delta 
up to Western Jutland and beyond. This geographical distribution suggests that 
the change was spread as a result of maritime, notably trading contacts, especially 
if we also recall that Bergen, Copenhagen and Stockholm have a long history of 
being maritime trading ports. 

The sociolinguistic pattern is a complex one where long-term contact between 
geographically neighbouring communities can lead to the diffusion of linguistic 
innovations gradually outwards from areas where they have already become estab- 
lished; but at the same time, long-distance contact between urban centres as a 
result of communication and trading patterns can lead to the development of 
exclaves such as Bergen and Stockholm. 

The case for the role of diffusion is made stronger, too, when we note the dat- 
ing: the gender merger probably began during the 1500s and became consolidated 
in the 1600s in the northern Netherlands (Geerts 1966), and in Bergen 
(Nesse 2005), and in Copenhagen (Pedersen 1999). The date for the first available 
texts showing the merger in West Frisian is not until the 1600s (Hoekstra 2001b), 
but we do lack texts from the late 1500s. 


English 


As just noted, Wahrig-Burfeind (1989) points to England as being one of the ker- 
nel areas from which diffusion of gender loss may have spread. Since English was 
the first Germanic variety to start down the path of gender loss, we cannot expect 
to find geographical diffusion from elsewhere bringing the change to England. 
Explanations therefore have to focus on England itself; and it must surely be sig- 
nificant that simplification took place in the grammatical structure of late Old 
English generally, not just in terms of gender. Old English was a highly synthetic 
inflecting language, while Middle English was less fusional, showing a clear move 
towards a more isolating type of morphology. 

Many authors have argued that this overall simplification was due at least in 
part to adult language contact. Milroy says of the simplification that “it seems 
clear that such a sweeping change is at least to some extent associated with 
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language contact” (1992: 203). The traditional view has been that it was contact 
with Old Norse that brought about the reduction in complexity; but there are 
now also many advocates of the view that it was contact with Celtic Brittonic 
which was responsible. It seems likely, in fact, that both were involved (Trudgill 
2010b, 2011a, 201b). 

What is not so clear is the degree to which contact can be accepted as having 
been involved specifically in bringing about gender loss. For one thing, this was 
certainly not something which happened relatively rapidly, as with Afrikaans. 
According to Jones (1988) it was a process that was ongoing for as much as 
300 years, from about 950 to 1250 - compare this with the 150 years or less in the 
case of Afrikaans. And other non-contact explanations are widely accepted. Lass 
for example, like Perridon, does see a role for phonological change, including in 
this case the weakening and loss of final unstressed syllables, as well as gram- 
matical and semantic developments. But he attributes the loss of gender mainly 
to “a shift towards a system in which sex (or the lack of it) became the primary 
or sole determinant” as well as “gradual relaxation of grammatical constraints” 
(2000: 106-107), and the resolution of conflicts between natural and grammati- 
cal gender in favour of the natural - as in the case of the neuter noun cild ‘child’ 
referring to a boy being treated as a masculine. Lass’s view is in some measure 
supported by Curzan’s (2003) analysis, mentioned above, which showed that 
masculine and feminine inanimates were increasingly treated as neuters during 
the relevant period. 

But this is not really an explanation. It is more of a description of what hap- 
pened which does not help us to understand why this process took place - nor, 
crucially, why it took place in English and not in the other early mediaeval 
Germanic languages. 

And again the sociolinguist does see two clues that contact can once again be 
invoked as an explanatory factor in gender loss. First, Jones (1998) disputes Lass’s 
interpretation. According to him, texts from the relevant period show many in- 
stances of unetymological gender assignment which do not involve reassignment 
according to natural gender. What we see in the texts can be interpreted, he says, 
simply as increasing uncertainty about which noun had which gender - which 
reminds us of Kusmenko’s (2000) argument about gender confusion and simplifi- 
cation on the continent. 

Secondly, Lass points out - and other writers on this topic are agreed 
(see e.g. Bolze 2007) - that “like many other structural simplifications in English, 
gender loss began in the north” (Lass 2000: 107). The gender-loss process can be 
seen to be under way in Old English in texts from the 900s in the north of the 
Old English-speaking area, and seems to have spread south and been complete 
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everywhere except, significantly, Kent in the far southeast of England by 1300. 
(Here again we can see geographical diffusion at work.) 

White (2002, 2003) has argued that the general inflectional attrition which 
occurred as a consequence of Late British/Old English contact, especially in the 
north, was later reinforced there by contact with Old Norse. This meant that the 
north experienced a double dose of language contact, a fact which he suggests 
explains why this part of England was by far the most innovating region in the late 
Old English/early Middle English period. White argues that the English Southeast 
was the most conservative area because it experienced significant contact with 
neither Old Norse nor Late British, while East Anglia and the Southwest were both 
intermediate in terms of degree of change because they experienced only one con- 
tact phase each - with Old Norse and Late British respectively. 

We can also safely assume that in actual speech the gender-loss changes began 
considerably prior to the dates just given for when the changes began to surface in 
extant written records. This timing suggests that gender loss in English is unlikely 
to have been part of the same wave of simplification as the one which occurred 
centuries later on the continent, in spite of Wahrig-Burfeind’s suggestion. Coupled 
with the difference in linguistic outcomes, this makes it look as if the English de- 
velopment of gender loss was not part of the same process as the one which took 
place later in the Netherlands, Germany and Scandinavia. 


Totenmalet again 


English was not only the first Germanic variety in which gender loss began to be 
manifested, it has also gone further down this track than any other variety except 
for Afrikaans. In terms of the trajectory of change as outlined above, English rep- 
resents the last stage of development where gender merger and loss have gone to 
completion. Bergen Norwegian has not travelled as far as English down this track, 
but amongst varieties of Norwegian, as the only local Norwegian dialect which has 
totally lost the feminine grammatical gender, it has gone the furthest. And 
Totenmalet represents the earliest possible stage, in which no step in the direction 
of gender reduction has yet been taken at all, a position it shares with Icelandic, 
Faroese, High German and a number of other Scandinavian, particularly 
Norwegian, varieties. 

In view of the forgoing discussion, this grammatical conservatism of 
Totenmalet would now seem to be rather uncontroversially accounted for by 
sociolinguistic factors. The correct analysis would appear to be that this rural in- 
land dialect has been little affected by contact and simplification resulting from 


102 Peter Trudgill 


immigration or urbanisation; that it has not been involved in language shift; and 
that it has also been, at a distance of almost 100kms/60 miles from the nearest 
maritime port, relatively immune to the diffusion of the innovation of gender sim- 
plification as spread by coastal trade. 

This conservatism is not only demonstrated in the preservation of the Proto- 
Indo-European grammatical gender system. Totenmalet has also preserved the 
dative case, which has long since gone missing from urban and coastal dialects of 
Norwegian — in many Norwegian varieties it had already started disappearing be- 
fore 1350 (Skard 1967: 95). (The dative has also gone from many varieties of 
Swedish; and all varieties of Danish, English, Dutch and Low German.) In this, 
Totenmalet again resembles those most conservative of Germanic varieties, the 
Insular Scandinavian languages Icelandic and Faroese, and High German. 

In attempting to account for differential gender loss in Germanic, we have 
noted that internal linguistic change may have gender merger as a consequence; 
but that this cannot at all explain why merger has occurred in some places but not 
others. For those varieties which do demonstrate gender reduction, Malkiel’s 
(1967) insight about multiple causation or “pluricausality” is very welcome. There 
are so many different things that can happen in linguistic change that, for some 
particular change actually to happen, in some particular place at some particular 
time, it has to have a number of different factors working in its favour, in that place 
and at that time. It makes good sense to suppose that gender reduction took place 
in Bergen as a result of simplification due to adult foreign language-learning in the 
language/dialect-contact situation; and language shift by Hanseatic Germans to 
Norwegian; and Norwegian dialect mixture; and the diffusion of the innovation 
from centres further south on the European continent. 

In Toten, gender reduction simply had many fewer factors going for it. 
Two-gender Bergen Norwegian developed in a city which had considerable and 
large-scale maritime trade with northern Germany, Holland, England and else- 
where; large-scale immigration from different Low German-speaking Hanseatic 
towns; the presence for many centuries of communities of ship-borne foreigners; 
and considerable experience of urbanisation as a result of in-migration from sur- 
rounding areas of western Norway. The three-gender dialect area of Toten experi- 
enced none of those factors - something which I suspect current speakers of the 
Toten dialect will have few regrets about. 
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Non-finite adjuncts in French 


Hans Petter Helland 
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In this paper, I discuss the morphological build-up of the French gérondif en 

+ V-ant and compare it to the present participle V-ant. On the basis of this 
discussion, en is to be treated as a prepositional complementizer just as de and 

a with infinitives. Arguments are given for a recategorization process of en from 
preposition to complementizer. The analysis is supported by facts from binding, 
agreement and cliticization. This means that the French gérondif, besides being 
verbal in nature, enters into clausal constructions just like the present participles. 
This conclusion enables us to revise some of the arguments in the literature for 
the gérondif as a mono-morphemic structure. 


1. Introduction 


It is a well-known fact from the descriptive literature that French has two compet- 
ing non-finite structures in adjunct position, as illustrated in (1) and (2): 


(1) [Sortant du cinéma], Emile a rencontré Léa 
Leav.PRS.PTCP the cinema, Emile has met Léa 


ro? 


“Leaving the cinema, Emile met Léa’ 


(2) [En sortant du cinéma], Emile a rencontré Léa 
En.PREP leav.PRS.PTCP the cinema, Emile has met Léa 


ro? 


“Leaving the cinema, Emile met Léa 


As its verbal base, the example in (1) contains a present participle (sortant), while 
the same verbal base in the gérondif (2) is obligatorily preceded by the preposition 
en (en sortant). The question then is how to handle the morpho-syntactic and se- 
mantic specificities of the gérondif compared to the present participle. Is the géron- 
dif simply a combination of the preposition en and the present participle, thereby 
yielding compositionality, or should en V-ant be treated as a single discontinuous 
morpheme in which en has lost all of its properties as a preposition? This question 
has received some attention in the literature over the past years (see Kleiber 2007), 
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though generally the bi-morphemic analysis is implicitly rejected assuming that 
the preposition en does not have an autonomous and independent interpretation. 
This means that it can no longer be seen as a P with a lexical content. According to 
standard assumptions, en should therefore be treated as a morphological ingredi- 
ent of the gérondif. 

In this paper, I will first present the arguments that are explicitly used by 
Kleiber (2007) for saying that the gérondif should be treated as a single, morpho- 
logical unit. On the basis of this presentation, I will provide some clarifications on 
the status of en, comparing it to prepositional complementizers with infinitives 
(de and a). Strong arguments for en’s status as a complementizer will be found by 
investigating the inner structure of the gérondif (and the present participle) on 
structural grounds. Binding, agreement and cliticization show the clausal proper- 
ties of both the gérondif and the present participle construction. It will therefore 
follow that en has become a prepositional complementizer, just as with de/a with 
infinitives. This conclusion enables us to revise some of the arguments in the lit- 
erature for the gérondif as a mono-morphemic structure. 


2. Ant-forms in French 


French has three participle constructions ending in -ant: the so-called ladjectif 
verbal in (3), the present participle in (4) and the gérondif in (5): 


(3) a. Un livre intéressant 
A book interesting.apj.M.sG 
“An interesting book” 


b. Des livres intéressants 
Books _interesting.ADJ.M.PL 
“Interesting books” 


c. Ce livre est intéressant 
This book is interesting.ADJ.M.sG 
“This book is interesting” 


(4) a. Sortant du cinéma, Emile a rencontré Léa 
Leav.PRS.PTCP the cinema, Emile has met Léa 


ro 


“Leaving the cinema, Emile met Léa 


b. Sortant du cinéma, Mariea  rencontré Léa 
Leav.PRS.PTCP the cinema, Marie has met Léa 


lo 


“Leaving the cinema, Marie met Léa 
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c. *Sortante du cinéma, Marie a rencontré Léa. 
Leav.PRS.PTCP.E.SG the cinema, Marie has met Léa 


> 


“Leaving the cinema, Marie met Léa 


(5) a. En sortant du cinéma, Emile a rencontré Léa 
En.PREP leav.PRS.PTCP the cinema, Emile has met Léa 


ro 


“Leaving the cinema, Emile met Léa 


b. En sortant du cinéma, Mariea rencontré Léa 
En.PREP leav.prs.pTcP the cinema, Marie has met Léa 


ro 


“Leaving the cinema, Marie met Léa 


c. *En sortante du cinéma, Marie a  rencontré Léa. 
En.PREP leav.PRS.PTCP.F.SG the cinema, Marie has met Léa 


lee) 


“Leaving the cinema, Marie met Léa 


The -ant-forms in (3) behave as true adjectives, so they will not be of primary 
concern here. These are true modifiers in nominal phrases (within the DP), agree- 
ing with the head noun in both gender and number (intéressant/intéressants) and 
appearing readily in predicational constructions after the copula verb étre. Rather, 
our main interest in this chapter will be the contrast between the present participle 
in (4) and the French gérondif (FG) in (5). These ant-forms are both verbal, accept- 
ing the same types of complements as the corresponding (non-derived) verbal 
forms (Il/Elle sort du cinéma), and they invariably exhibit no signs of agreement in 
nominal features. As non-finite forms, the ant-forms in (4) and (5) both have im- 
plicit subject arguments that are controlled by a nominal constituent, normally the 
subject (Emile or Marie) in the matrix clause. Whether the controller is masculine 
(Emile) or feminine (Marie), the ant-form stays the same (4)-(5). 

The phrases headed by the present participle in (4) and the FG in (5) function 
as adjuncts. Detached at the front of the clauses, the two constructions seem to be 
interchangeable without major semantic differences. In other contexts, however, 
there are no ways of using one in place of the other. From a standard viewpoint, 
only present participle phrases can be modifiers of noun phrases (6), whereas FGs 
are modifiers of verb phrases (7): 


(6) a. Un livre intéressant les éléves 
A book interest.prs.ptcp the pupils 
“A book which is interesting for the pupils” 
b. *Un livre en intéressant les éléves 


A book en.prep interest.prs.pTcP the pupils 
“A book which is interesting for the pupils” 
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(7) a. Il est parti | en courant 
He left en.PREP run.PRS.PTCP 
“He ran away” 


b. *Il est parti courant 
He left run.prs.pTCP 
“He ran away” 


Using the terminology of Haug, Fabricus-Hansen, Behrens and Helland 
(2012), I label the present participle in (6) adnominal, and the FG in (7) adverbal. 
In fact, in most kinds of contexts, there is no type of competition between the two 
forms at all. Besides the function as nominal or verbal modifiers, Halmoy 
(2003: 30-33) identifies at least two further types of usages in which the FG is ex- 
cluded in contrast to the present participle, notably as predicates of absolutes (8) 
or small clauses (9): 


(8) a. La nuit tombant,ils sont rentrés chez eux 
The night fall.prs.prcp, they returned home 
“At nightfall, they...” 
b. *La nuit en tombant, ils sont rentrés chez eux 
The night en.prep fall.prc.prcp, they returned home 
“At nightfall, they...” 


(9) a Il Ta surprise béchant dans son jardin 
He surprised her dig-prc.ptcpin her garden 
“He surprised her while she was digging in the garden” 
b. *Il Ta surprise en béchant dans son 
He surprised her en.PREP dig.pRS.PTCP in her 
jardin (on the intended reading) 
garden 


“He surprised her while she was digging in the garden” 


As with the corresponding present participle, the FG of (9) is starred if it is in- 
tended to mean that the object pronoun (or the subject of the small clause) is 
surprised when digging in the garden. With the FG, the clause can only mean that 
it is the subject pronoun (il) that digs in the garden. This pattern can of course be 
understood if the present participle is treated as an adnominal form associated 
with the pronominal object form, whereas the FG is adverb(i)al, taking the higher 
subject argument as its implicit subject. Hence, in most cases the present participle 
and FG have non-overlapping uses. Nevertheless, they basically have the same 
form. Let us now turn to the question of how this paradox has been resolved in the 
literature. 
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3. One or two forms? 


In order to define the specific character of the FG, there are two main positions in 
the literature. Either the FG is seen as a simple combination of the preposition en 
and the present participle or, alternatively, it is treated as a single discontinuous 
morpheme en _ ant, en having lost its properties as a preposition. Unfortunately, 
these questions have not really been addressed in the literature in an explicit man- 
ner. To the best of my knowledge, the only person to do so has been Georges 
Kleiber (2007), whose contribution will therefore constitute our starting point for 
the discussion in the coming sections. In short, in Kleiber’s terms, the two posi- 
tions above are labeled the bi-morphemic versus the mono-morphemic analysis. 


3.1 ‘The two proposals 


According to the bi-morphemic analysis, the FG does not exist as an autonomous 
unit at the morphological level. Instead, the preposition en combines composi- 
tionally with the present participle to yield the complex expression in (10): 


(10) [,en] + [present participle] 


This is not at all an original proposal, as it has been defended in the standard lit- 
erature by authors such as Grevisse (2008: 1152) and Wilmet (1998: 297), although 
it should be noted that the argumentation is generally not spelled out in detail. 
First of all, the bi-morphemic analysis is morphologically motivated, taking its 
position from the traditional dogma of one form-one meaning. But it is also based 
on syntactic criteria: The two forms are autonomous in the sense that they are used 
independently in other contexts. Thus, it can be easily demonstrated that both en 
and the present participle have independent uses, with en as the head of preposi- 
tional phrases (en + DP) and the present participle (as in (1) above) without the 
introduction marker en. The two morphological ingredients of the complex ex- 
pression can then be easily detached from each other, yielding the rather natural 
conclusion that with the FG, the two autonomous parts are simply combined. 

However, this position is countered by some highly evident morpho-syntactic 
considerations at the synchronic level. Consequently, while it is clear that etymo- 
logically speaking, en is to be considered as a preposition, this is not necessarily 
the case from a synchronic point of view. This is revealed most clearly by the con- 
trasts between Old French (11) and Modern French (12), as drawn by Halmoy 
(2003: 62): 


(11) En\a\de\ par \ por ... + present participle 
(12) En\*a\*de \ *par \ *pour ... + present participle 
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In Old French, en was just one preposition among many others - some of them are 
listed in (11) - to be used in front of the present participle. In contrast, Modern 
French only has the combination of en with the present participle, as all the other 
cases in (11) are deemed ungrammatical. The (modern) FG is therefore most com- 
monly defined as a proper form in the literature (see Halmgy (2003) and many 
others). In other words, there is no doubt about the etymological (and composi- 
tional) origin of the form (10)-(11), but synchronically, the two forms have lost 
their autonomy, thereby forming a single morphological unit. In fact, there is no 
way of identifying the morphological units of the FG as autonomous parts of a 
complex expression, which is the position Kleiber (2007) identified as the mono- 
morphemic analysis. 

According to Kleiber (2007), the bi-morphemic analysis implies the subordi- 
nating scheme in (13), in which en as a P applies to the verbal ant-form to yield a 
complex expression: 


(13) En (Vant) 


Adopting the mono-morphemic analysis, however, means that the FG should be seen 
as a discontinuous morpheme en-ant, which applies to the verbal base as in (14): 


(14) En_ant (V) 


This is a major point for Kleiber’s argumentation because it implies that the com- 
parison made between (15) and (16) below, even by proponents of the mono- 
morphemic analysis such as Bonnard (2001: 81), is now no longer tenable. In fact, 
(15) and (16), as adapted from Bonnard, force us to adopt the subordinating 
scheme in (13), although the former is not compatible with the discontinuous 
morpheme analysis in (14): 


(15) a voir, de parler, pour vendre, sans finir, ... *en voir. 
to see.INF, to speak.INg, in order to buy.1ng, without finish.1nr, ... en. PREP 
see.INF 


(16) *a voyant, *de parlant, *pour vendant, 
to see.PRS.PTCP, to speak.pRs.PTCP, in order to buy.PRS.PTCP, 
*sans finissant, ... en voyant. 
without finish.PRS.PTCP, ... Nn. PREP See.PRS-PTCP 


Whereas infinitives, as Kleiber puts it, are indeed instances of (13), this cannot be 
the case for the FG. The only way of comparing the infinitives in (15) with the FG 
in (16) would be to take the variable prepositions combined with the (variable) 
infinitival endings as discontinuous morphemes. This of course is not an alterna- 
tive. Various prepositions (4, de, de, pour, sans...) are combined with different types 
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of infinitival endings (-(e)r, -r, -re, etc.), which means that there is no formal link 
between the specific P and the infinitival morphology. Instead, if the mono-mor- 
phemic scheme in (14) is adopted, the FG has a unique morphological build-up. 

But before refusing en bloc, the comparisons between (15) and (16) allow us to 
take a closer look at the PPs in (15). What Kleiber intends to show is that these are 
all of type (13), with various prepositions freely taking infinitival complements. 
Yet this is not necessarily the case. It is well known, at least since Kayne’s seminal 
works from the 1970s and 1980s (see Kayne 1984), that de and a may behave as 
true complementizers. Using the Kaynian terminology, the infinitival comple- 
ments in (17) and (18) are introduced by prepositional complementizers: 


(17) Il promet de venir. 
He promises to.PREP.COMP come 


(18) Il apprend a nager. 
He learns to.PREP.COMP swim 


Hence, promettre and apprendre are to be treated as transitive verbs selecting in- 
finitival complements as direct objects. Various informal tests can be used in order 
to show this. When verbs such as promettre and apprendre take sentential objects, 
they are introduced by true complementizers such as que: 


(19) Il promet quwil viendra. 
He promises that.comp he will come 


(20) Il apprend quil viendra. 
He hears _ that.comp he will come 


Straightforward substitution patterns tend towards the same conclusion: 


(21) Il le promet. 
He it.cL promises 
“He promises it” 


(22) Il lapprend. 
He it.ci hears 
“He hears it” 


If the sentential objects in (17) and (18) were instead introduced by prepositions, 
we would expect pronominal patterns such as (23) and (24), which are clearly not 
available: 


(23) *Il en promet. 
He about it.cL promises 
“He promises about it” 
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(24) *Il y apprend. 
He to it.cL learns 
“He learns to it” 


More strikingly, when de and a indeed behave like prepositions, the substitution 
patterns are reversed: 


(25) Il se contente de partir. 
He himself contents to.PREP leave 
“He is happy to leave” 


(26) *Il se le  contente. 
He himself it.cL contents 
“He is happy it” 
(27) Il sen contente. 
He himself it.cL contents 
“He is happy with it” 


(28) Il invite les gens da venir. 
He invites the people to.PREP come 


(29) *Il le les invite. 
He it.cL them.c. invites 
“He invites them it” 
(30) Il les y invite 
He them.c1 to it.cL invites 
“He invites them to it” 


Moreover, when the same verbs are followed by DP-complements, de and a (31-32) 
are absent in contrast to the PP-complements in (33-34): 


(31) Il promet une réponse/*dune réponse. 
He promises an answer/ofan answer 


(32) Il apprendla natation/*d la natation. 
He learns the swimming 
“He learns to swim’/to the swimming 


(33) Il se contente de partir/d’une réponse. 
He himself contents to.pREP leave/with.PREP.an answer 
“He is happy to leave/with an answer” 


(34) Il les invite a venir/a une réception. 
He invites them to.PpREP come/He invites them to.PREP a reception 
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This shows that de and d in (17)-(18) are complementizers and not prepositions 
in the sense of (13) and (14), whereas the same forms act as prepositions in (25) 
and (28). 


(35) Il promet [,» de venir]. 
He promises to.PREP.COMP come 

(36) Il apprend [,, a nager] 
He learns to.PREP.COMP swim 


(37) Il se contente [,, de partir]. 
He contents himself to.PREP leave 
“He is happy to leave” 


(38) Il invite les gens [,,a venir] 
He invites the people to.PREP come 


The second main point for Kleiber in order to refuse the comparisons between 
(15) and (16) emerges due to similarities between the FGs in (39) and the PPs with 
nominal complements in (40). According to Kleiber, this enables new empirical 
discoveries of the FG, particularly for the comitative meanings and the meaning 
equivalencies with PPs introduced by avec, sans, etc. and the FG: 


(39) Pierre boit avec/sans modération. 
Pierre drinks with/without moderation 


(40) Pierre se rase en chantant/sans chanter. 
Pierre shaves himself en.PREP sing.PRS.PTCP 
“Pierre shaves himself while singing”/without sing-1nF “without singing” 


Both avec-complements and the gérondif then have as their antonymous counter- 
parts PPs, as introduced by sans. However, the basic assumption underlying this 
line of argumentation is that the PP complement of sans and avec, even for the 
infinitive and like the FG in (40), basically has a nominal character and should 
possess nominal features. 

But this cannot be taken as a general point. Building on classical argumenta- 
tion from the Principles and Parameters theory, it could be shown that neither the 
infinitival complement nor the FG in (40) are nominal constituents. Instead they 
should be treated as clauses. Take as a starting point the complex sentence in (41): 


(41) Il promet de dire au revoira Jlenfant. 
He promises to.PREP.COMP say goodbye to __ the child 


As we have seen above, promettre takes a sentential direct object introduced by the 
prepositional complementizer de: 
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(42) Il le/*en — promet. 
He it.cL/it.cL promises 
“He promises it/of it” 


With the non-finite verb dire of the embedded clause being a three argument-verb 
(quelqu’un dit quelque chose a quelqu’un), the pronominal D-subject of the upper 
clause apparently fills two functions in the complex sentence: first as the subject of 
promettre, then as the implicit subject of the verb dire. Since the 1970s, this phe- 
nomenon has been explained by the notion control of an implicit subject PRO in 
traditional terms. Even if dire in (41) does not possess an explicit subject, it does in 
fact assign a thematic role to its first (and external) argument, which must be 
structurally present. This assumption has not been altered with the movement 
theory of control from later years (see Boeckx, Hornstein and Nunes 2010), in 
which the lower subject is allowed to move from its base position, thereby leaving 
a copy on its way up to its final position as the subject of promettre. The moved 
constituent and its copy are therefore necessarily co-referential. 

If we compare the infinitival complement in (41) with the FG in (43), we ob- 
serve a direct parallelism: 


(43) Charles pleura en lui disant adieu. 
Charles cried en.pREP him.cL say.pRs.PTCP goodbye 
“Charles cried while saying goodbye to him” 


Whether the implicit subject of the non-finite adjunct in (43) should be repre- 
sented by a traditional PRO, as in standard Control Theory, or be allowed to move 
out in the Movement Theory of Control (MTC), there is obligatorily a co-reference 
between the upper subject and its lower copy or controlee. It follows that both the 
infinitive in (41) and the FG in (43) possess a structural subject, leaving no doubt 
about their clausal nature. Further arguments can also be put forward in order to 
show this (see also Legendre 1997). 

It is well known that a reflexive pronoun (or an anaphor) that obeys principle 
A of the Binding Theory must be bound by a local antecedent. 


(44) Il, S;.jest lavé. 
He himself:REFL.CL washes 
“He washes himself” 


‘The reflexive pronoun must have the same reference as its local subject, otherwise 
the example is ungrammatical. Next, compare (44) with (45) below: 


(45) Il a_ glissé [en @; Sei lavant] 
He has slipped en.pRep himself:REFL.CL wash.PRS.PTCP 


“He slipped while washing himself” 
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Just like in its finite counterpart, the reflexive is bound by its local subject, being 
represented for the time being by e. Without the presence of this local subject, no 
element in the local domain of the anaphor would be able to bind it. We find ex- 
actly the same pattern with the infinitive: 


(46) Il veut [e, SE laver]. 
He wants to himself.REFL.CL wash.INF 
“He wants to wash himself” 


Again, the local character of the relationship between the anaphor and its subject 
explains the mechanisms in (46). The bracketed constituents in (45)-(46) are 
therefore clausal. This type of argumentation is strengthened by a second type of 
facts, this time considering the local relation of the agreement. Past participles in 
(written) French agree with its (small clause) subject, as is shown with the passive 
participles in (47): 
(47) a Il a été payé 
He has been paid.pst.PTCP.M.SG 
“He has been paid” 
b. Ellea été  payée 
She has been paid.pst.PTCP.E.SG 
“She has been paid” 


This pattern generalizes to both the FG in (48) and the infinitive in (49): 


(48) a. Il, a parcouru le monde [tout en i 
He has traveled around the world en.PREP be.PRS.PTCP 
étant payé] 


paid.psT.PTCP.M.SG 
“He traveled around the world while being paid” 


b. Elle, a parcouru le monde [tout en e, 
She has traveled around the world en.PREP be.PRS.PTCP 
étant payée]. 


paid.psT.PTCP.E.SG 

“She traveled around the world while being paid” 
(49) a. Il, veut[e; étre payé] 

He wants be.INF paid.psT.PTCP.M.SG 

“He wants to be paid” 

b. Elle; veut [e, étre  payée] 
She wants be.inF paid.psT.PTCP.RSG 
“She wants to be paid” 
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By postulating the existence of an implicit subject (e) within the non-finite brack- 
eted clause, the local agreement relationship between the participle and its subject 
is maintained. 

Further arguments in favor of the clausal nature of the non-finite adjunct are 
given by the ordering of clitic pronouns. In standard French, pronominal clitics 
are attached to the leftmost verb of the clause. With a finite verb, whether it is a full 
verb or an auxiliary verb, this pattern is shown in (50): 


(50) a. Il le lui dit 
He.ct it.cy him.ct/her.ci says 
“He says it to him/her” 


b. Il le lui a dit 
He.ct it.ct him.ct/her.ct has said 
“He said it to him/her” 


Next, compare the clitic placement in the main clauses of (50) with the infinitives 
and the FGs in (51)-(52): 


(51) a. Il souhaite [le lui dire] 
He wishes it.cL him.ct/her.cL say.INF 
“He wishes to say it to him/her” 


b. Il a souhaité [le lui dire]. 
He has wished it.cLh him.ct/her.c. say.1NF 
“He wished to say it to him/her” 


(52) a. Il pleure [en le lui disant] 
He cries en.PREP it.cL him.ct/her.cL say.PRs.PTCP 
“He cries while saying it to him/her” 


b. Il a_ pleuré [en le lui disant]. 
He has cried en.PREP it.cL him.ci/her.cL say.PRS.PTCP 
“He cried while saying it to him/her” 


In the complex sentences of (51) and (52), the leftmost verb to which the clitic 
pronouns attach is the non-finite verb form (the infinitive dire or the FG disant) 
and, of course, not the main clause verb forms (souhaite/a souhaité or pleure/a 
pleuré): *Il le lui souhaite dire./*II le lui pleure en disant. These contrasts are readily 
explained if the non-finite constructions in (51) and (52) are treated as clauses. If 
we insert a (structurally present) implicit subject (e) in (51)-(52), the infinitive 
and FG then form a natural clause boundary, with the clitics being attached to the 
leftmost verb form of the lower clause: 
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(53) a. Il souhaite [e le lui dire] 
He wishes it.cL him.ct/her.cL say.INF 
“He wishes to say it to him/her” 


b. Il pleure [en ele lui disant]. 
He cries en.PREP it.cL him.ct/her.c say.pRs.PTCP 
“He cries while saying it to him/her” 


The arguments above then show that just like with infinitival complements, the FG 
not only has verbal features in itself, the construction as a whole in fact has clausal 
properties. Both infinitives and FGs are domains for binding, agreement and cliti- 
cization, which also generalizes to present participles. For the following, we there- 
fore make the subsequent assumptions, which contradict the proposal by Kleiber 
(2007). 


i. In general, the comparison between the PPs with infinitival complements in 
(15) and the FG in (16) are not illicit, as claimed in Kleiber’s paper. The Ps in 
(15) may act as true prepositions, but they can also introduce both infinitives 
and FGs as (prepositional) complementizers. 

ii. As (prepositional) complementizers, de, a and en take sentential comple- 
ments. 


These assumptions presuppose a recategorization process of en from P to C in the 
FG, with the P initially being the head ofa PP, though turning in to the head of the 
CP (P > C)!: 


(54) [pp [p [cp [c enl yp € parlant]]]]] 


We will now concentrate on the internal build-up of the FG and the present parti- 
ciple both of which reflect the properties in (i) and (ii), as well as the formal link 
between the two forms in -ant. 


4. The inner structure of the participle constructions 


First, compare the FG and the present participle in (55-56) with the past participle 
constructions in (57-58): 


(55) En Yapercevant trop tard, il... 
En.PREP.COMP him.ct/her.cL/it.cL notic.PRS.PTCP too late, he... 
“Noticing him (etc.) too late, he... 


1. See Rafel (2000) for discussions of the reverse type of recategorization process: C > P. 


2. The data in this section is mainly taken from Helland and Pitz (2012). 
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(56) Lapercevant trop tard, il... 
Him.ct/her.ct/it.cL notic.PRS-PTCP too late, he... 
“Noticing him (etc.) too late, he... 


(57) ‘Terrorisés, les informateurs se sont enfouis. 
Terrifi.psT.PTCP.M.PL the informers ran away 
“Terrified, the informers ran away” 


(58) *Les terrorisés, ils se sont enfouis. 
Them.ci terrifi.pst.PTCP.M.PL, they ran away 


Contrary to past participles (58), both the FG (55) and the present participle (56) 
admit object clitics. Even more strikingly, the latter two have a ne-pas-negation in 
contrast to the past participle: 


(59) En se plaignant de TIaffaire, il... 
En.PREP.COMP complain.prs.prcp about the deal, he... 
“While complaining about the deal, he...” 


(60) En ne se plaignant pas de Taffaire, il... 
En.PREP.COMP not complain.prs.ptcp about the deal, he... 
“Not complaining about it..” 


(61) Partant avec ses collégues, il... 
Leav.PRS.PTCP with his colleagues, he... 
“Leaving with his colleagues, he..” 


(62) Ne partantpas avec ses collégues, il... 
Not leav.prs.pTcpP with his colleagues, he... 
“Not leaving with his colleagues, he..” 


(63) Venus dAsie centrale, ils... 
Com.PST.PTCP.M.PL from Central Asia, _ they... 
“Coming from...” 


(64) *Ne venus pas d Asie centrale, ils... 
Not com.Pst.PTCP.M.PL from Central Asia, they... 
“Not coming from...” 


Contrary to the past participle, the cliticization pattern in (55-58) and the negation 
pattern in (59-64) point to the conclusion that the FG and the present participle 
project to the sentential level. Below, we are going to claim that this means that 
there is a (defective) TP-projection above the the vP-level for both (verbal) present 
participles and FGs. Furthermore, FGs and present participles admit all kinds of 
verbs, whether they are active, passive, unergative or unaccusative as in (65), 
whereas past participles are restricted only to passives and inaccusatives (66): 
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(65) a. (En) renégociant le contrat, il... 
(En.PREP.COMP) renegotiate.PRS.PTCP the contract, he... 
“Renegotiating the contract, he..” 


b. (En) étant renégocié, le contrat 
(En.PREP.COMP) be.PRs.PTCP renegotiate.psT.PTCP, the contract 
entre en  vigueur... 
comes into effect 
“Being renegotiated, the contract comes into effect” 


c. (En) arrivant a la gare, il... 
(En.PREP.COMP) arriv.PRS.PTCP at the railway station, he... 
“Arriving at the railway station, he...” 


d. (En) dansant avec sa femme, il... 
(En.PREP.COMP) danc.PRS.PTCP with his wife, he... 
“Dancing with his wife, he..” 


(66) a. *Renégocié le contrat, il... 
Renogiat.pst.ptcp the contract, he... 
“Renegotiating the contract, he...” 


b. Renégocié, le contrat entre en vigueur 
Renegotiat.psT.PTCP.PAss, the contract comes into effect 
“Renegotiated, the contract...” 


c. Arrivé a la gare, il... 
Arriv.psT.pTCP at the railway station, he... 
“Arriving at the railway station, he...” 


d. *Dansé avec sa femme, il... 
Danc.pst.PTcP with his wife, he... 
“Dancing with his wife, he...” 


These patterns (65-66) imply that FGs and present participles are less defective than 
past participles in French. Taking this into consideration, we are led to the hypothesis 
that transitive v-heads, such as terminant in (en) terminant son travail, have proper- 
ties that distinguish them from past participle heads in *terminé son travail. 

We concentrate on the representation of FG and present participles. For the 
time being, we represent the implicit subject form by (little) pro. Pro is co-indexed 
for the spec-position of vP and TP, and is forced to move to the specifier of TP for 
EPP reasons. The TP must hence be defective in the sense of Chomsky (2001) and 
DAlessandro and Roberts (2008) because it does not assign a structural case to its 
specifier. Nonetheless, it respects Burzio'’s generalization by having an externally 
implicit (vP-specifier) argument, thus allowing an accusative case to be assigned 
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to the internal argument of V (le travail). I follow traditional considerations 
(see Helland and Pitz 2012) by letting the internal argument of V occupy the spec- 
position of VP. As with this latter treatment, we attach the participial morphology 
to the v-head and have the verbal stem move from the lower V-projection, picking 
up its participial morphology on its way up to the (defective) T-head position. 
What would distinguish the FG and the present participle in (67) is the presence of 
a CP-level for the former, in which the prepositional complementizer en takes as its 
complement the (defective) TP headed by the present participle: 


(67) CP 


Cc’ 


ae 
oN 


BE os. 
termin,tant vP 
a 
Vv VP 
termintant 
DP Vv’ 
son travail | 
V. 
ternuin- 


For the sake of convenience, we have represented the spec-position of VP and TP 
in (67) by (little) pro. However, as we have seen above, in traditional Control The- 
ory (Chomsky 1981: 74-85), the implicit argument of the non-finite clause was 
represented by (big) PRO. The existence of a PRO subject was theoretically justi- 
fied by the theta criterion, which enforced the implicit argument to have its own 
thematic role independent of the thematic role of the controller. In a structure like 
(2), repeated in (68), this means that the external argument for the upper clause 
(Emile) would have its own thematic (agentive) role as a controller of PRO in the 
embedded non-finite clause: 


(68) [En PRO, sortant du cinema], Emile;a rencontré Léa. 
En.PREP.COMP leav.PRS.PTCP the cinema, Emile has met Léa 
“Leaving the cinema, Emile met Léa” 


However, in minimalist accounts since Hornstein (1999), the existence of PRO 
has been questioned and even abandoned, thus redefining control as movement. 
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This position necessarily implies a redefinition of the theta criterion, saying that 
that there are no upper limits on the number of theta roles a DP argument may 
have. Interestingly, the movement account is also applied to adjunct control in 
cases like (69): 


(69) John heard Mary [without [fehn entering the room]] 


This is achieved through a so-called sideward movement, with John leaving a copy 
within the adjunct. In fact, in MTC, whenever movement is theoretically an op- 
tion, we find an obligatory control that makes the standard control cases in (17) 
and (18) above resemble the adjunct control in (69). If we were to treat (68) in 
parallel fashion, this would mean that the subject part of the FG may be allowed to 
move out from the adjunct through a sideward movement. Moreover, the FG in 
(68) is interchangeable with the present participle, with minimal meaning differ- 
ences. This should mean that (68) may have the structure in (70): 


(70) [En Emile sortant du cinema], Emilea rencontré Léa. 
En.PREP.COMP leav.PRS.PTCP the cinema, Emile has met Léa 
“Leaving the cinema, Emile met Léa” 


However, in other cases the movement option is clearly not available. In MTC, this 
is illustrated through the impossibility of having the implicit subject argument 
move out of a gerund in the subject position. 


(71) *John said that [[Jehrn washing himself] delighted Mary. 


Such island constructions are treated as elsewhere cases in MTC (see Boeckx, 
Hornstein and Nunes 2010). There are also cases for the FG in which the move- 
ment option does not seem to be available: 


(72) En arrivant a 
En.PREP.COMP alriv.PRS.PTCP in 
Perrache, il faut rejoinder la gare routiére a pied. 
Perrache., ... 
“Arriving in Perrache, it is necessary to go to the bus station by foot” 
(73) Liste a fournir en arrivant a la douane. 
List to supply en.PREP.COMP arviv.PRS.PTCE... 
“List to supply when arriving at the custom” 
(74) En attendant Godot (book title). 
En.PREP.COMP wait.pRSs.PTCP for Godot 
“Waiting for Godot” 
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In (74), there is not even a thematic position to which the implicit subject of the 
non-finite form may move. And in (72-73), it is not clear how the (arbitrary) con- 
troller of the infinitive could be realized by movement. If these are elsewhere cases, 
we should therefore expect a pro-subject for the FG. This would mean that for the 
FG, both a subject copy (70) and pro-subject (72-74) should be available. Note 
that in cases like (72-74), in which the movement option is difficult to motivate, an 
alternation between the FG and a present participle does not seem to be available: 


(75) *Arrivant a Perrache, il faut rejoindre... 
Arriv.PRS.PTCP in Perrache, ... 
“Arriving in Perrache, it is necessary to go to the bus station by foot” 


(76) “Liste 4 fournir arrivant ala douane. 
List to supply arriv.PRs.PTCP... 
“List to supply when arriving at the custom” 


(77) Attendant Godot. 
Wait.pRs.PTCP for Godot 
“Waiting for Godot” 


In fact, the present participle seems to require an explicit controller. This con- 
straint is also compatible with cases like (9) above, in which only the present par- 
ticiple can be used. These latter instances would probably also lend themselves to 
a movement account. For this reason, the control properties are not the same for 
the two constructions. The present participles are restricted to obligatory control, 
while this is not the case for the FG. 


5. Conclusions 


In this chapter, I have given an account of the morpho-syntactic build-up of the 
FG, and compared it to the present participle. Maintaining the compositional na- 
ture of the construction, I have argued for a recategorization process of the 
preposition en which introduces the FG, thus turning it into a prepositional com- 
plementizer. This is easily motivated by comparing it to prepositional complemen- 
tizers with infinitives and distinguishing it from Ps that are, on the synchronic 
level, freely combined with infinitives. We have also seen that both the FG and the 
present participle project up to the clausal level and are not nominal constituents, 
such as stated in Kleiber (2007). Clitic placement, agreement and negation pattern 
all present evidence in favor of this conclusion. By projecting a defective TP, the 
two ant-forms are then both clausal in nature. Even so, they differ in their control 
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properties, with the FG allowing both obligatory and non-obligatory control, 
while the present participles are restricted to obligatory control. 
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Topics and the left periphery 


A comparison of Old French and Modern Germanic 


Christine Meklenborg Salvesen 
University of Oslo 


Just as the Modern Germanic languages (with the exception of English) have 

a verb second structure (V2), Old French was a V2 language. In this paper I 
compare Old French and Modern Germanic and show that they share the basic 
configuration. I suggest that these languages all have obligatory verb movement 
to Fin®, and that only material in SpecFocP and SpecFinP are visible for V2. All 
the V2-languages investigated exhibit instances of apparent V3-constructions 
that may be accounted for in a split-CP-framework. Crucially, there is a 
difference between Modern Germanic and Old French regarding topics. In 
Modern Germanic, a left dislocated topic must be immediately followed by its 
resumptive pronoun, a constraint which is not operative in Old French. 


1. Introduction 


Cartographic research has led to a more profound understanding of the fine 
grained structure of the different parts of the derivation.! Prior to Pollock (1989), 
the derivational structure was basically VP-IP-CP. Subsequently this structure has 
been refined into an overwhelming number of cartographic positions, even though 
the basic conception of the three major fields - Verb Phrase (VP or vP), Inflec- 
tional Phrase (IP) and CP (Complementiser Phrase) have subsisted. In this work I 
will investigate the subfields of the CP in Modern Germanic (with the exception of 
English) and Old French. Although Old French is a Romance language and as 
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such not related to the Germanic languages, all the languages show a number of 
similarities. Some of these may be caused by the fact that the language spoken in 
Gaule was substantially influenced by Frankish from the 6th century. 

Just as the Modern Germanic languages in question, Old French was a verb 
second (V2) language. It has, however, been suggested that Old French has more 
occurrences of V3 structures than the Modern Germanic languages. This fact that 
hasled researchers to claim that Old French was nota true V2 language (Kaiser 2002; 
Rinke and Meisel 2009). Others attribute its structure to a lower landing site for 
the finite verb so that more positions become available in the hierarchy above the 
finite verb (Poletto 2002; Walkden in press). 

I will argue that Old French and Modern Germanic V2 languages have acti- 
vated the same positions in the left periphery and that the finite verb moves to the 
same head. Considering the V3 word orders in Old French, we will see that the 
majority of these clauses pattern with word order in Older Germanic V2 lan- 
guages rather than the Modern Germanic ones. When it comes to topicalised 
structures, I will show that the use of these is more restricted in Modern Germanic 
than in Old French. This means that whereas Old French shares syntactic proper- 
ties with the Germanic languages, it has the pragmatic properties of the Romance 
languages. 

In the following I will first define V2 in Section 2. Then I will show how Old 
French patterns with modern V2 languages and examine its left periphery in 
Section 3. Having done so, I will turn to the Modern Germanic V2 languages Ice- 
landic, Norwegian and German in Section 4. Finally, I will discuss the pragmatic 
and syntactic properties of left dislocated topics and hanging topics in Section 5. 

The examples in this article are in Old French unless otherwise indicated. The 
references should be read as follows: the first number indicates the century in 
which the text was written (e.g. 13th = 13th century). Then follows the abbreviated 
name of the text in question. A list of quoted texts can be found as an appendix. 
The final number indicates either the verse (texts in verse), the chapter (texts in 
prose) or the page (p). 

Examples in Norwegian are from the Eastern dialects in the Oslo-Romerike 
area. 


2. V2and the split CP 


The notion of V2 means that the finite verb occurs in the second position. Holmberg 
(2011) writes that “[a] language is called a verb-second (V2) language when the 
finite verb is obligatorily the second constituent, either specifically in main clauses 
or in all finite clauses.” 
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This phenomenon characterises most Germanic languages, such as Icelandic 
and Faroese (Insular Scandinavian), Danish, Swedish, Norwegian (Mainland 
Scandinavian), German, Dutch, Afrikaans and Yiddish. In fact, Modern English is 
the only Modern Germanic language that does not have V2 (Holmberg 2011). The 
status of Old and Middle English is less clear, and researchers have suggested that 
there were several competing grammars in the old language (Bech 2001; 
Westergaard 2009; van Kemenade and Westergaard 2012). V2 is also attested in 
Breton, in Estonian, Sorbian, Kashmiri and in two dialects of Himachali (Holmberg 
2011). A number of Old Romance languages have been claimed to be V2 as well 
(Salvi 2004), such as Old Portuguese (see Eide (2006) for a discussion), Old Italian 
(Poletto 2006), Old Neapolitan (Ledgeway 2008). One modern Romance language 
displays V2, namely Rhaetoromance (Poletto 2002). 

The notion V2 simply describes the position of the finite verb in declarative 
main clauses. A language such as English is an SVO language where an XP may 
precede the subject without this affecting the internal order of the subject and the 
verb as in (1). In a V2 language, however, a fronted XP yields the order VS. This 
means that the canonical V2 language will present either the order S-V-O (2-a) or 
XP-V-S-O (2-b). 


(1) a. Jane met aman. 
b. Yesterday, Jane met a man. (English) 


(2) a. Janne motte en mann. 
Jane met a man 


“Jane met a man.” (Norwegian) 
b. I gar motte Janne en mann. 

in yesterday met Jane a man 

“Yesterday, Jane met a man.” (Norwegian) 


The V2 phenomenon has been analysed as the result of verb movement to C° 
(see among others den Besten (1983); Holmberg and Platzack (1995); Schwartz 
and Vikner (1996)). The element preceding the finite verb will be in SpecCP, both 
when it is a subject (3) and when it is a non-subject XP (4). The subject following 
the finite verb will be in SpecTP (4). 

(3) [cp Janne [c motte ... [1p [vp en mann]]]] 

(4) [cp I gar [c motte [1p Janne [vp en mann]]]] 


Ever since Rizzi (1997), the picture sketched out above has been somewhat blurred. 
Rizzi proposes that the CP field consists of several projections in a fixed order (5). 


(5) ForceP ... TopP ... FocP ... (TopP) ... FinP. 
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Force encodes the clause type, whereas Fin relates to finiteness. Fin either selects a 
+Finite complement or a —Finite one. Between the two projections, there is a Focus 
projection and recursive Topic positions. Beninca and Poletto (2004) have elabo- 
rated on this analysis by proposing that Focus and Topic may be conceived of as 
fields, each containing a number of projections ordered according to a strict hier- 
archy. In the clause, the topics always precede the foci. They propose that there are 
two different kinds of topics, namely the ones setting the frame for the clause, and 
the ones constituting its theme. The higher projection, the Frame, is situated to the 
left of the ForceP and “contains more salient information from an informational 
point of view (...) information about the main Topic and the “where and when” of 
the sentence” (Beninca and Poletto 2004: 71). The Theme is situated to the right of 
ForceP and contain less salient elements. Beninca and Poletto suggest that the 
makeup of the left periphery displays a “progression toward new information” 


(6) Frame... Force... Theme ... Focus ... Fin 


The three fields have different structural properties. Elements in the Focus field 
have moved there, whereas elements in the Topic field (Frame and Theme) have 
been merged there (Cinque 1990; Beninca and Poletto 2004). The differences be- 
tween the different types of topic will be discussed in Section 3.5. 

It is important to stress that the labels here refer to syntactic positions rather 
than strict pragmatic functions (Beninca 2006). Further, I will assume that the 
hierarchy of positions is universally accessible, but that languages vary in respect 
to what positions they use (Cinque and Rizzi 2008). 

When the CP has been split it is less obvious where the verb lands in a V2 
language. Holmberg (2011) defines V2 in the following way (7), thus avoiding the 
issue of where exactly the verb moves to. 


(7) ‘The V2 property is made up of the two components: 
a. A functional head in the left periphery attracts the finite verb. 
b. This functional head wants a constituent moved to its specifier 
position. 


In the following I will refer to Holmberg’s definition as the V2 requirement. 

In principle all heads in the CP field are accessible to the finite verb. The ques- 
tion is to determine exactly where it ends up. Roberts (2004) has proposed that in 
all V2 languages, the finite verb has to check a feature in Fin®. Only one element 
may precede it, moving through its specifier, which then serves as a bottleneck. 
Salvi (2004) avoids the split-CP model and proposes the finite verb moves to a 
head above IP, which he calls T/F°. The constituent occupying its specifier position 
has the pragmatic function of a theme or a focus. In addition, Salvi proposes that 
there is a projection above T/FP, namely T”P, which represents the periphery of 
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the clause. This projection is not involved in the V2 structure. Poletto (2002), on 
the other hand, proposes that some V2 languages may have a strong Force® feature 
that needs to be checked as well. This way the finite verb may be in either Fin® or 
Force®. In order to determine where the finite verb sits in a V2 language, it is nec- 
essary to take a closer look at the left periphery. I will first do a rather thorough 
investigation of Old French in Section 3 before turning to Norwegian, Icelandic 
and German in Section 4. 


3. ‘The left periphery of Old French 


Old French is the language spoken in Gaul in the Middle Ages. The first written 
document in vernacular French dates to 842 (the Oaths of Strasbourg). Old French 
is a Romance language, but the language shows several Germanic properties 
(Mathieu 2007) that are attributed to the conquest of Gaule by the Franks in the 
5th century (see Rey et al. (2011) for a detailed presentation). 

It is common to distinguish between Old and Middle French. Old French is 
characterised by a binary Case system which distinguishes between a subject case 
and an oblique case. The transition between the two periods is defined by the loss 
of this binary case system, which took place towards the end of the 13th century. 
The period of Middle French period stretches from around 1300 to 1600, when the 
Classical period starts. This article primarily examines Old French. 


31 <A V2 language 


The fact that the verb tends to come in second place in Old French was first point- 
ed out by Thurneysen (1892) and it is common to analyse Old French as a V2 
language (Foulet (1930); Adams (1987, 1989); Roberts (1993); Vance (1989, 1997); 
Buridant (2000); Cardinaletti and Roberts (2002); Salvi (2004); Mathieu (2006, 
2007); Labelle (2007), but pace Rinke and Meisel (2009) and Kaiser (2002)). If the 
subject is in the first position of the clause, as in (8-a), the verb follows the subject, 
just as in Modern French - or Modern English, for that matter. However, if a con- 
stituent other than the subject is in the first position of the clause, the subject 
follows the finite verb. In (8-b) the DP cele nuit is in first position, and the subject 
is prevented from preceding the verb (fu). The subject (Galaaz) appears in third 
position of the clause. 


(8) a. Galaaz regarde le vallet qui si durement plore 
Galahad watches the valet who so hard cries 
“Galahad looks at the valet who cries so hard.” (13th, Queste 39) 
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b. Cele nuit fu  Galaaz serviz au melz que li frere 
this night was Galahad served on.the best that the brothers 
porent 
could 


“This night, Galahad was served in the best way the brothers knew.” 
(13th, Queste 45) 


3.1.1 Pro and overt subjects 

There is a correlation between a fronted XP and the use of pro, as Foulet (1930) 
was the first to observe. In (9) the adverb lors ‘ther’ is in the first position of the 
clause, the finite verb comes in second place, and no subject is expressed. 


(9) Lors voient devant els au travers du chemin une 
then saw before them on.the through ofthe road a 
abeie de nonains 
monastery of nuns 
“Then they saw in front of them, on the other side of the road, a convent.” 
(13th, Queste 2) 


Based on a study of two large excerpts of la Queste du Saint Graal, a 13th century 
prose text, Vance (1997: 38) provides a detailed count of the different word orders. 
In main clauses where an XP precedes the verb, 33% of the subjects are omitted. 
Excluding nominal subjects, the rate is as high as 76.3%. 

The fact that pro only occurs in contexts where V2 is licensed has been inter- 
preted as the consequence of verb movement to the C-domain. Only when the 
verb is under C° may it license the omission of the pronominal subject (Vanelli 
et al., 1985; Roberts 1993; Beninca 2006). 

The traditional analysis proposes that pro and a pronominal subject appear in 
the same position of the clause. Vance (1997) has pointed out that a postverbal 
pronominal subject is always adjacent to the verb (10). 


(10) Sire, fait il, auvec vos n irai je pas. 
sir, did he, with you NEG go.FUTI NEG 
“Sir, he said, I will not go with you.” (13th, Queste 4) 


Whereas the inverted pronominal subject immediately follows the finite verb, 
there are two different positions for the full DP suject: either immediately follow- 
ing the finite verb or in its base generated position in vP. This is clear when the verb 
is in a compound tense, as in (11). In (11-a) the subject (in boldface) is placed 
between the auxiliary and the participle, whereas it follows the participle in (11-b). 
The former type of inversion has been coined Germanic inversion, the latter 
Romance inversion (Adams 1987; Prévost 2011). 


Topics and the left periphery 137 


(11) a. Un jor s estoit li rois levez et fu venuzen son palés.. 
one day REFL was the king risen and was come in his palace 
“One day the king had woken up and gone to his palace ... “ 
(13th c., Trist-prose, 438) 


b. Tout einsint ont anonciee li hermiteet li saint 
all this-way have announced the hermits and the holy 
home vostre venue plus a de vint nz 
men your arrival more has of twenty years 
“This way the hermits and the holy men have announced your arrival 
more than twenty years ago.” (13th, Queste 44, cit. Prévost (2011)) 


The name used by Prévost clearly indicates the difference between the two types of 
inversion: The Germanic type is the one found in Germanic V2 languages, and the 
Romance one resembles the one that is used in Modern Romance languages. The 
similarities between present day Romance final inversion and the Old French kind 
are, however, superficial, as pointed out by Vance (1997). She discusses the so 
called free inversion in Italian at length and compares it to the cases where the Old 
French subject appears after the participle, e.g. not immediately after the finite 
verb. She shows that these two inversions differ in many aspects. In Old French, 
the subject may be followed immediately by a direct object, an option which seems 
more restricted in Italian. The subject in Old French primarily carries old informa- 
tion, whereas they are prone to carry contrastive focus in Italian. Finally, in Italian 
the subject may always precede the finite verb in these constructions. In Old 
French that is not possible. Vance concludes that this inversion “should be consid- 
ered as a part of the V2 grammar of Old French” (Vance 1997: 92). 

‘Thus it is important to maintain that in Old French inversion is the rule (12-a), 
just as it is in a Modern Germanic language, such as German (12-b). In German 
the word order in (12-c) is ungrammatical. 


(12) a. Or dit li contes que grant tens demora li rois 
now says the story that big time rested the king 
Melyadus sanz _— feme 
Meliaseus without wife 
“Now the story tells that for a long while the king Meliadus lived with- 
out a wife” (13th c., Trist-prose, 244) 


b. Jetzt lesen die Kinder ein Buch. 
now read the children a book 
“Now the children are reading a book” (German) 


c. *Jetzt die Kinder lesen ein Buch. 
now the children read a book 
Now the children are reading a book” (German) 
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In Modern Romance, inversion is however optional. In a language such as Spanish, 
a fronted adverb may trigger subject inversion (13-a) or it may not (13-b) 
(all examples from Ruiz Rufino (2010: 204)). The unmarked word order with tran- 
sitive verbs is S-V-O. If the constituents appear in a different order, the word order 
is marked for informational structural purposes (Ruiz Rufino 2010). 


(13) a. Ayer vio Andrea la _ pelicula. 
Yesterday saw Andrea the film 
“Yesterday Andrea saw the film: (Spanish) 


b. Ayer, Andrea vio la _ pelicula 
Yesterday Andrea saw the film 
“Yesterday Andrea saw the film.” (Spanish) 


3.2 Several elements in front of the finite verb 


Even though Old French patterns with modern V2 languages in the cases cited 
above, it is a fact that there is a considerable number of cases where more than one 
constituent precedes the finite verb. In Rinke and Meisel’s corpus study, 5.1% of 
the examples have two elements preceding the finite verb (Rinke and Meisel 2009); 
16.5% of declarative matrix clauses with explicit subject have a superficial V3 
structure in Kaiser (2002). In Radwan (2011), as many as 19.2% of the investigated 
main clauses have more than one element preceding the finite verb. Given the 
conception of a V2 language as a language where “the finite verb is obligatorily 
the second constituent” (Holmberg 2011), these proportions are quite surprising. 
The question of course is what causes these unexpected structures. I will return to 
the V3 word orders in Section 3.6.1. 

I will start by examining the interrogative structures before turning to the de- 
clarative clauses. At first I will consider the wh word, then I will look at the finite 
verb and finally the different kind of topics. 


3.3. FocusP and the position of the wh-word 


In Old French direct questions are formed by wh-movement, and in accordance 
with the wh-criterion (Rizzi 1996) the finite verb is always adjacent to the wh-word 
(14). (In (14-a) m’en ‘me of-it’ are clitics upon the finite verb porte ‘carry:) 


(14) Ou wm en porte on? Que devenrai?» 
where me of-it carry.PRS one what become.FUT 
“Where are they taking me? What will I become?”(12th c., Trubert, 2335) 
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According to Rizzi (2001) the wh-word itself carries a focus feature in direct inter- 
rogative clauses. Thus it is not possible for a wh-word and a focus to co-occur. In 
embedded clauses, however, focus may precede the wh-word, but never follow it. 
These facts led Rizzi to conclude that in main clauses, the wh-word targets the 
SpecFoc position.” The focus and the wh-word thus compete for the same position 
and may not co-occur. 


3.3.1 Embedded wh 

This paper essentially discusses declarative main clauses. It is, however, necessary 
to take a brief look at the word order in embedded interrogative clauses. In an 
embedded context, nothings blocks the co-existence of a focus and a wh-element. 
Rizzi notes that “the wh elements in embedded questions are not forced to move 
to the Spec of FOC, contrary to main questions” In Italian embedded clauses, 
there is evidence that a focus may precede the wh-word. Hence, Rizzi concludes 
that there is a Foc position in front of the wh position (15). 


(15) Embedded clauses: ... Foc ... wh... Fin 


There is, however, evidence that the position of the embedded wh-word is not univer- 
sal. In Modern French, the distribution of wh-words and left dislocated topics indi- 
cates that the embedded wh-word in fact targets SpecForce (Rowlett 2007; Salvesen 
2009). In Old French, however, an embedded wh-word preceding a topic is never 
found, but the inverted order is possible. In (16) the wh-word (underlined) precedes 
the embedded topic (in boldface). As que is not yet a clitic in the old language, this 
may not be interpreted as the result of some sort of cliticisation mechanism. 


(16) ne set souz cielque faire doie 
NEG know under sky what do.1nF should 
“T don't know what on earth I should do” (12th c.,Renart-I, 1631) 


Given the above data, we are forced to conclude that the wh-word does not target 
SpecForceP in embedded clauses, and as such it patterns with Modern Italian 
when it comes to the wh-word in embedded clauses (Salvesen 2009). Building on 
Rizzi’s analysis, I will assume that the position of the wh-word in main clauses is 
SpecFocP, and that it occupies SpecWhP in embedded ones. This gives us the fol- 
lowing structure in main declarative clauses (17). 


(17) Main declarative clauses: ... Foc ... Fin 


2. This holds for wh-words that are moved from the lower part of the clause. wh-words that 
are assumed to be base generated in the CP field, such as perché “why” and come mai “how 
come” occupy a higher position (Rizzi 2001). This is exactly the same difference that is found 
with regard to foci and topics in Beninca and Poletto’s account (Beninca and Poletto 2004). 
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In the next section, I will argue that the finite verb sits under Fin°. 


3.4 Remnant movement and the finite verb 


We have now determined that in declarative main clauses, the wh-word occupies 
the lowest specifier position above FinP, namely SpecFocP. Now, let us once again 
look at the embedded interrogative clause. Even though embedded clauses in Old 
French generally have the structure S-V with an expressed subject (see Vanelli 
et al. (1985); Adams (1987); Vance (1997); Beninca (2006) and numerous others), 
it is quite common to find embedded interrogatives with pro (see example (18)). I 
analyse these clauses as having verb movement to the C-domain. In embedded 
questions where no subject is expressed, an infinitive may intervene between the 
wh-word and the finite verb (18). Recall that the embedded wh-word occupies the 
specifier of the lowest projection above FinP in the system sketched out by Rizzi 
(2001), namely WhP. 


(18) Certes, ne sai que faire puisse 
certainly NEG know what do.INF can 
“T really don’t know what to do? (12th c., Trist-Thom, 535) 


In (18), the infinitive faire occurs between the wh-word and the auxiliary. Cross- 
linguistically, fronting of an infinitive has often been regarded as an instance of 
Stylistic Fronting (Maling 1990; Holmberg 2000; Mathieu 2006). Holmberg (2000) 
has proposed that the stylistically fronted element checks a P-feature, which in 
other words requires the overt realisation of the specifier of the head carrying the 
feature. As such the fronted infinitive is analysed as a head that moves by long 
head movement and which ensures the V2 character of the clause. 

In Old French and Old Italian, this head may be accompanied by its comple- 
ment. In Old French, the complement mon conseill ‘my advice’ precedes the 
infinitive in (19-a), whereas the complement proemio ‘preamble’ follows the in- 
finitive in the Old Italian example (19-b). 


(19) a. se mon conseill croire volez. 
if my advise  believe.1nF will 
“if you will take my advise” (13th c.,Rose, 7846) 


b. Ese fare proemio non vuole 
and if do.nF preamble NEG wants 
“And if he doesn’t want to make a preamble ..” 
(Old Italian, from Franco (2009: 86)) 


3. Holmberg proposes that the EPP feature is split between a D-feature and a P-feature. I will 
return to this split EPP in Section 5.2. 
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A fundamental difference between the two languages is the order in which the ele- 
ments occur. In Old French the XP always precedes the X° (Mathieu 2006), where- 
as in OI, it is the other way around. 

If the stylistically fronted element moves to check a P-feature, it would suffice 
to move one and only one element. It has been generally assumed that the fronted 
infinitive is a head which ensures the V2 character of the clause. If this is so, there 
is however no no reason for the XP (mon conseil) in (19-a) or the X° (fare) in (19-b) 
to move. I will therefore follow Ott (2009), Franco (2009), and Salvesen (2009, 
2011) in assuming that the stylistically fronted element is moved by phrasal move- 
ment rather than head movement. As such fare proemio ‘do preamble’ is to be anal- 
ysed as one constituent, and so is mon conseill croire ‘my advice believe’ 

Franco (2009) analyses the Old Italian construction as an instance of VP 
fronting. This is an analysis that cannot be extended to Old French. If it were in 
fact the VP that were fronted in Old French, the order would be X°-XP, which is 
not. There is, however, independent evidence that in Old French both comple- 
ments and other arguments may be fronted in the vP phase, as proposed for Old 
Italian by Poletto (2006, 2011). In (20), the direct object mon conseil my advice 
has been moved across the infinitive, but arguably not as far as to the IP. In (20) 
the indirect object a home mortel ‘to a mortal mar’ precedes the past participle 
descovert ‘revealed’ but follows the low adverb encore ‘still. According to Cinque 
(1999), encore is one of the lowest adverbs in the IP field. This is a strong indica- 
tion that the indirect object has not evacuated the vP field. 


(20) quant cil li dit ce qu il n’ avoit encorea home 
when this-one him says that what he Nec had still to man 
mortel descovert 
mortal revealed 
“when this man tells him what he has not yet revealed to any living man” 
(13th c., Trist-prose, 170) 


I will analyse mon conseil as having been moved to the left periphery of the vP, 
building on the idea that phases are built the same way at all levels (Belletti 2004). 
If there is a left periphery at the highest level (= CP), there is also a left periphery 
in the vP, and presumably also in the DP (Belletti 2004; Poletto 2006, 2011). If the 
verb has to move to the left periphery in the CP, the verb also has to move to the 
left periphery in the vP (Poletto 2006, 2011). The fronting of an XP in the vP seems 
however optional, presumably because there is no EPP. 

Building on these assumptions, the structure of (20) will then be as in (21). I 
do not name the projections in the left periphery of the vP, as this is not crucial to 
the presentation. Instead I use the neutral expressions XP and X°. 
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(21) [xp mon conseil [,. croire [,p pre [y. ereire [y, 


mon-conseil [,. eroire]]]]]] 
I follow Salvesen (2009, 2011) in claiming that it is this vP that is fronted to the 
CP. This is an instance of Remnant Movement as the vP has been evacuated by 
the subject (pro in this case) prior to movement. This way, the chunk that is 
moved by phrasal movement to the left periphery is much bigger in Old French 
than in Old Italian. 

This analysis implies that what appears to be a head is in fact a phrase. Hence, 
there is no need to postulate that movement takes place twice in the example in 
(21). Its structure will be as in (22). 


(22) se [,, mon conseill croire] volez 


If the infinitive is not in fact a head, but belongs to a phrase moved in front of the 
finite verb, this means that it must occupy the specifier in front of the finite verb. 
We have already seen that there is no projection between whP and FinP. This leaves 
one available specifier, namely SpecFinP. It follows from this that the finite verb 
must occupy Fin® (see also Troberg 2004; Labelle and Hirschbithler 2005; Labelle 
2007; Mathieu 2006; Salvesen 2009, 2011).* This is identical to the proposal made 
for Old Italian by Franco (2009), for Rheto-Romance by Poletto (2002) and for 
Old High German by Walkden (in press). 
We then get the following order of the lower part of the left periphery: 


(23) focus/wh-word [,.._., XP [,... finite verb] ]] 


[roc FinP Fin’ 
3.4.1 The particle si 

In Old Romance there is a particle that clearly belongs to the CP field, namely si, 
derived from Latin sic ‘so. What characterises si is that it competes with the subject 
for the first position of the clause. It is always directly followed by the finite verb: 
no other element may intervene between the two. If the subject is expressed, it fol- 
lows the finite verb. Si either appears after an initial topic or it is in clause initial 
position. It may host right adjoining clitics (Ferraresi and Goldbach 2003). In 
(24-a) there are three coordinated main clauses. The subject la pucele ‘the young 
girl is expressed in the first clause, but not in the following clauses. Instead, si in- 
troduces the two main clauses that are adjoined to the first one. In (24-b) the sub- 
ject follows the finite verb. 


4. Note that Labelle and Hirschbiithler (2005); Labelle (2007) and Mathieu (2006) postulate 
verb movement to Fin® only in the case where an XP other then the subject precedes the finite 
verb. They follow Vance (1997) in claiming that the subject never moves any higher than 
SpecTP/SpecIP (depending on the terminology). 
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(24) a. Lors uient la pucele tot maintenant a Lanselot si 
then comes the girl all now to Lancelot s1 
sageneille deuant lui si li dist... 
kneels before him si him says 
“Then the girl all of a sudden walks up to Lancelot and kneels before 
him and tells him: ...” (13th c., Mort A, p8) 


b. [Au matin] si fu li parlemenz en un vergier a 
on-the morning si was the parliament in a garden of 
Yabaie | madame Sainte Marie de Soissons. 
the abbey madam Holy Mary of Soissons 
“In the morning, there was an assembly in a garden of the abbey of 
Our Lady St. Mary of Soissons.” 
(12th c., Villehardouin, from Ferraresi and Goldbach (2003)) 


The semantic content of si is difficult to define, even though it is often translated by 
‘and’ as in the example in (24-a). However, its properties are different from those 
of a conjunction and it may even co-occur with et ‘and’ (Radwan 2011). Schosler 
and Reenen (2000) have shown that si first and foremost marks topic continuity in 
the text. This is clear in (24-a), where the two last clauses have the same subject as 
the initial clause, namely la pucele ‘the young girl’ 

Several linguists have viewed si as an adverbial phrase occupying a specifier 
position (Adams 1987; Vance 1997; Salvi 2004). Beninca (2006) postulates that si 
is the specifier of a focus projection. 

Both Ferraresi and Goldbach (2003) and Ledgeway (2008) have proposed 
that si is a functional head realising Fin®. Ferraresi and Goldbach assume that 
the finite verb then moves to this position,” whereas Ledgeway proposes that si 
in itself fulfills the V2 requirements by Merge rather than Move. However, 
Ledgeways analysis predicts that one would get the word order si - subject, 
which is in fact not the case. He explains this absence in Old Neapolitan by the 
fact that this language does not project SpecTP. Old French, on the other hand, 
does, so Ledgeway’s analysis cannot be extended to this language without 
modifications. 

Given that si is so closely linked to V2 and that we have established that V2 in 
Old French is related to a particular feature of Fin®, I will however propose that si 
occupies a specifier and that it belongs to FinP rather than to FocP. I will get back 
to si and compare it to the Norwegian particle sd in Section 4.4. 


5. As Ledgeway (2008) has pointed out, this would imply that the finite verb precedes si, which 
does not happen. 
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Given the data discussed here, we may conclude that Old Frenchmain declara- 
tive clauses have activated both the positions in Rizzis system present in the lower 
part of the left periphery: 


(25) ... Foc... Fin... 


3.5 The topics 


Having thus established the order between the elements in the lower part of the CP 
field, let’s turn to the upper part. Recall the outline of the exploded CP, briefly de- 
scribed in (6), repeated here as (26). 


(26) Frame... Force ... Theme... Focus ... Fin 


According to Beninca and Poletto (2004), topics appear in both the Frame and the 
Theme. The Frame consists of a Hanging Topic and a Scene Setting position, 
whereas the Theme is the position for left dislocated elements. This gives us a more 
fine-grained map of the CP field (27). 


(27) Hanging Topic ... Scene Setting ... Force ... Left Dislocation ... Focus ... Fin ... 


Mathieu (2010) proposes that the co-occurrence of a V2 grammar and V3 word 
order is in fact due to the rich left periphery of Old French. He then describes the 
existence of Hanging Topics (henceforth HT) and Left Dislocated topics (LD) 
based on the punctuation and on the existence of resumptive pronouns. His claim 
is that a HT does not require a resumptive pronoun, whereas a LD does. This is the 
inverse of what Beninca and Poletto (2004) describe for Modern Italian. On their 
account, there are a number of differences between a HT and a LD: (1) A HT may 
not be a PP, only a DP, whereas a LD may be both; (2) there may be several LDs, 
but only one HT; (3) LDs only require a resumptive pronoun when they corre- 
spond to direct or partitive objects. HTs always require a resumptive pronoun. The 
LD resumptive pronoun always corresponds in person, number and case, but a 
HT resumptive pronoun only corresponds in person and number; (4) The resump- 
tive pronoun of a LD is always a clitic, whereas it may be a clitic, a tonic pronoun 
and an epithet if it is a HT. 

Yet another difference between the HT and the DL is that HTs are located to 
the left of ForceP, whereas LDs are to its right. Thus a subordinate clause may con- 
tain a LD, but not a HT. 

Beninca and Poletto base their observations primarily on Rhaetoromance and 
Italian dialects, so these descriptions must be handled with care when applied to 
Old French. It seems however reasonable to assume that some of their claims must 
be valid more generally. HT is characterised by the fact that PPs are not allowed, 
only DPs. In the Medieval Romance languages only nominative and accusative/ 
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oblique case are spelled out morphologically; in the case of dative, a preposition 
must be added. This indicates that the HT-ed element must be in its basic form. 
Even when its resumptive pronoun is in the dative case, the HT element will not 
reflect that. I take this as one of the hallmarks in distinguishing between HTs and 
LDs. A HT will always be in nominative case, regardless of the syntactic status of 
the element it corresponds to in the following clause. A LD on the other hand, will 
reflect the case of its corresponding element. 

When it comes to the nature of the resumptive elements in the clause, I believe 
it is impossible to claim that these findings be valid in all languages, at least not 
prior to investigation. Troberg (2004) has shown how the use of resumptive 
pronouns has changed in the history of French. According to her, a resumptive 
pronoun in Old and Middle French could be a pro, a possibility which no longer 
exists in Modern French. Thus there is the possibility that the resumptive pronoun 
is an empty category. 

The following two sections will look at Left Dislocated topics (3.5.1) and 
Hanging Topis (3.5.2) separately and argue that both positions were available in 
Old French. 


3.5.1 Left Dislocation 
Old French clearly has the possibility of using Left Dislocation (28). 


(28) a. Cest chevalier, je ne I aim pas. 
this.oBLIQUE knight.oBLIQUE I NEG him.acc love NEG 
“As for this knight, I don’t love him” 
(12th c., Erec et Enide, 60, Buridant (2000: 756))) 


b. De cest avoir, certes, nus n avum cure. 
of this wealth surely we NEG have need 
“As for this wealth, surely, wedon’t need it” 
(11th c., Alexis, 532, from Buridant (2000: 745)) 


Old French DPs have morphological case and a binary case system containing a 
subject case and an oblique case. In (28-a) the fronted element is in the oblique 
case, which corresponds to the accusative pronoun le (I’) ‘hin’ in the clause. In 
(28-b) the fronted PP is not followed by a resumptive pronoun. According to 
Beninca and Polletto, only direct or partitive objects require a resumptive pronoun 
in the main clause. In addition prepositions are banned in HD position, which 
makes it even more plausible to regard de cest avoir ‘this wealth’ in (28-b) as a LD. 


3.5.2, Hanging Topics 
There is however reason to claim that Old French also makes use of the HT posi- 
tion (29). The Hanging Topic is normally in the nominative case (hence its Latin 
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denomination nominativus pendens ‘hanging nominative’). This would be the po- 
sition for vocatives. 


(29) Tristanzi, mout vos, a, ame. 
Tristan.NOM very you,.Acc have loved 


“Tristan, I have loved you very much.” (13th c., Trist-prose, 393) 


In (29), the DP Tristanz is in the nominative case, which does not correspond to 
its resumptive pronoun in the main clause, vos. Vos, being a direct object, is neces- 
sarily in the accusative case, even though case is not distinguishable on pronouns 
of the second person plural. The fact that the two do not carry the same case is an 
argument in favour of analysing the construction in (29) as a HT. Also note that 
what follows the HT is a V2 structure, with the adverb mout in first position, fol- 
lowed by the direct object clitic vos and the finite verb ai. 


3.6 Scene Setting 


In the framework of Beninca and Poletto (2004), there is a designated position for 
so-called Scene Setting adverbs in the Frame section. Together, Hanging Topics 
and Scene Setting adverbs compose the Frame of the clause, situated above ForceP. 
Beninca (2006) notes that circumstantial adverbs may appear in Frame, but there 
is no constraint: “whether they receive a Frame, Topic or Focus interpretation de- 
pends on pragmatics”. 

In general a fronted adverb in Old French fulfills the V2 requirement, as 
in (30). 


(30) Lors lieve li criz sor li 
then rise the cry on him 
“Then people started shouting at him.” (13th c., Trist-prose, 69) 


There are a number of adverbials that do not necessarily trigger V2 and that con- 
sequently may co-occur with other fronted XPs. Vance(1997: 61) provides an ex- 
haustive list of these adverbials in the Queste: onques (mes) “(n)ever (more); ja 
(mes) (n)ever (more), neporquant ‘nevertheless, certes ‘certainly; sanz faille ‘with- 
out fail’ as well as different oaths: par foi ‘by faith, par mon chief ‘by my head’ and 
por Dieu ‘for God’s sake? Arguably, these adverbials are in some cases rather par- 
enthetical, as certes in the exemple in (28-b), repeated as (31). 


(31) Decest avoir, certes,nusm avum cure. 
of this wealth surely we NEG have need 
“As for this wealth, surely, we don't need it” 

(11th c., Alexis, 532, from Buridant (2000: 745)) 
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However, in other cases these adverbials must be seen as Scene Setting elements. 
Although we will never know, it seems likely that they have a special intonation 
pattern, as their modern counterparts would have. 


(32) Par mon chief, danz chevaliers, vos parlez assez _ plus 
by my head sir knight you speak enough more 
orgueillousement que mestiersne vos fust! 
proudly than need NEG you was 
“By Jove, sir knight, you speak much more proudly than you should” 
(13th c., Trist-prose, 627) 


A similar construction would be possible in Norwegian, with a particular intona- 
tion pattern (33). There is a clear intonation break before the word du ‘you. 


(33) For guds skyld, # du ma ikke gjore dette. 
“For God's sake, # you must not do this” 


I will assume that these constructions are in fact marked prosodically and that 
they have the possibility of occupying the Scene Setting position. 


3.6.1 Fronted subordinate clauses 

As mentioned in Section 3.2, as many as 19.2% of the examined main clauses in 
Radwan’s study of the Tristan en prose have two elements preceding the finite verb 
(Radwan 2011). In most of these cases — 160 out of a total of 187 — the initial ele- 
ment is a subordinate clause. When the subordinate clauses are set aside, the 
amount of clauses with a superficial V3 word order in Radwan’s study shrinks to 
2.6%. This is coherent with observations made by Rinke and Meisel (2009).° 
A fronted subordinate clause is normally followed either by the subject (34-a), the 
particle si (34-b), or any other XP (34-c). Donaldson (2012b) has examined texts 
from the 10th to the beginning of the 14th centuries and has shown that most 
fronted subordinate clauses are situated outside the main clause proper.’ The ma- 
jority of fronted subordinate clauses are introduced by quant ‘wher’ or se 
‘if? - these two types account for 5935 out of 7054 clauses in Donaldson’s corpus 
(Donaldson 2012a). 


6. Kaiser (2002), on the other hand, does not provide us with any statistics as to the first ele- 
ment in the V3 constructions. 


7. Donaldson (2012b) also shows that during a brief period stretching from the late 12th to 
the early 13th century there is a tendency that the fronted clause may count as the first constitu- 
ent in the main clause. The main tendency is non the less that the subordinate clause appear 
outside the main clause. 
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(34) a. Quant la roine a_ dite ceste parole, ele comence mout 
when the queen has said this word she begins very 
tendrement a _plorer 
softly to cry 
“After having said this, the queen begins to cry softly” 

(13th c., Trist-prose, 924) 


b. Quant nos serons dusques la venu, si vos conterai adonc 
when we be.FuT thus there come, si you tell.FuT then 
tout ce que vos me demandez. 
all that what you me ask 
“When we have arrived there, I will answer all your questions.” 

(13th c., Trist-prose, 888) 


c. Mes se ceste parole ne fust, malli alast prochenement. 
but if this word NEG was bad him went soon 
“But if it hadn't been for these words, something bad would soon have 
happened tohim?”  (13thc., Trist-prose, 260, from Radwan (2011)) 


Fronted adverbial clauses in Old French are thus invisible to the V2 configuration, 
and must belong to the upper field of the left periphery. I will follow Donaldson 
(2012b) in suggesting that they have been merged in the Scene Setting position, to 
the left of the LD position. This analysis is supported by the fact that hanging top- 
ics always precede the fronted clause. 


(35) Tristanz, quant il vos plera, je P avrai 
Tristan when it you please.rurI1 it have.rFuT 
“Tristan, when it pleases you, I will have it” (13th c., Trist-prose, 397) 


In Modern Germanic V2 languages, such fronted adverbial clauses do trigger 
inversion, as in (36). 


(36) a. Da vi kom hjem, gikk vi og la oss. 
when we came home went we and laid us 


“When we came home, we went to bed” (Norwegian) 
b. Als wir zuriickgekommen waren, waren wir mide. 

when we_ back-come were were we tired 

“When we returned, we were tired.” (German) 


As such, the fact that Old French does not have inversion after fronted adverbial 
clauses may be seen as an argument against a V2 analysis. As it happens, Old 
French was not the only medieval V2 language where a fronted adverbial clause 
did not trigger inversion. A fronted subordinate clause was equally “invisible” to 
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the V2 configuration in Old High German (Axel 2007), Middle High German 
(Demske 2012) and Old Saxon (George Walkden, p.c.). All these are considered to 
be V2 languages. In these languages the verb is commonly in third place when an 
adverbial subordinate clause is in first position. In (37-a) the fronted subordinate 
clause is followed by the resumptive element dhar ‘there’ in (37-b) by the subject 
lute ‘people. 


(37) a. Dhuoir himilo garauui  frumida, dhar uuas ih 
when he heavens’ equipment created there was I 
“When he fashioned the heavens, I was there.” 
(Old High German, from Axel (2007: 228)) 


b. D6 mandenhelt reinen tz der stad in daz haz trig 
when one the hero perfect from the town in the house took 
lute volgeten im gentig die alle sére clageten, daz 
people followed him enough who all very mourned that 
sie vorlorn habeten alsé den wigant 
they lost had this-way the brave-warrior 
“When they had taken the hero from the town into the house, a lot of 
people followed him, all mourning very much because they had lost 
the brave warrior this way” 

(Middle High German, Tristrant 1077, from Demske (2012)) 


Just as in Old French the matrix clause in Old High German may be introduced by 
a resumptive element, tho ‘ther’ in (38). 


(38) So  hértho bilanzi sprehhanne/ tho quad her zi simone 
when he then left to speak / then said he to Simon 
“When he had finished speaking, he said to Simon ...” 
(Old High German, from Axel (2007: 232)) 


The fronted subordinate clause is normally an adverbial clause, and it seems justi- 
fied to see them as occupying the Scene Setting position. Axel draws the same 
conclusion when she notes that the “the adverbial clause (...) is base-generated to 
the left of ForceP (...)” (Axel 2007: 229). I thus go one step further in naming this 
position Scene Setting, and I crucially claim that it is the same position that is in 
use, both in the Old Germanic languages and in Old French. 

In his work on Old Norse (also a V2 language) Faarlund (2004) does not 
specifically comment on the fronted subordinate clauses, but his examples show 
that they are often followed by the resumptive element pd ‘then’ (39). This pat- 
terns with the Old French si constructions as well as the Old High German tho 
examples. 
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(39) en er  borir heyrdibettasagt,pa var very annan veg en 
but when Thori heard this said then was much other way than 


hann hugdi 

he thought 

“But when Thorir heard this, it was very different from what he (had) 
thought? (Old Norse, from Faarlund (2004: 235)) 


There is thus strong evidence that a fronted adverbial clause was invisible to the V2 
requirement in the Old Germanic languages at the turn of the first millennium. A 
major shift has taken place regarding the status of fronted clauses in V2 languages 
since then. Based on this, it is not at all surprising that Old French gets superficial 
V3 word order with a fronted subordinate clause. In this respect Old French pat- 
terns with Older Germanic and not Modern Germanic. 


3.7. Interim summary 


In the discussion above we have seen that Old French V2 clauses have verb move- 
ment to Fin®, and that we have different types of topics. Just as originally proposed 
by Beninca and Poletto (2004), there is a Frame to the left of ForceP, consisting of 
two positions: The Hanging Topic and the Scene Setting position. Then there is a 
lower topic position, namely the Left Dislocated topic. In Beninca and Poletto’s 
framework, this is the Theme. Further to the right there is a focus position and 
FinP (40). 


(40) The makeup of the left periphery in Old French 
ATP...ScSP...ForceP...LDP...FocP...FinP... 
cece jets —— 


FRAME THEME 


Our investigation of the left periphery suggests that there is a major difference 
between these positions and their relation to the V2 requirement. Only elements 
occupying SpecFocP or SpecFinP do in fact count as the first element in a V2 con- 
figuration. The reason for this may be that only these elements have been moved 
there, whereas the others have been base-generated there, as proposed by Cinque 
(1990); Beninca and Poletto (2004), and Beninca (2006).° The idea is exemplified 
in (41). This way the presence of a topic will induce V3 word orders, in some cases 
even V4. This is a strong hypothesis on topicalisation, and I will return to it in 
Section 4.1. 


8. Note that there is one apparent exception to this rule: the particle si which is probably base- 
generated in SpecFinP. 
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(41) HTP...ScSP...ForceP...LDP...FocP...FinP... 
—————_ 


base generated elements moved elements 


This conclusion is identical to the one that Poletto (2002) has drawn from Modern 
Rhaetoromance. Walkden (in press), who has studied Old Saxon and Old High 
German, also proposes that in Proto West Germanic the verb moves to Fin® and 
no further, thus opening up for V2 and V3 structures. Walkden claims that “the 
generalization of verb movement to Force®, and hence of V2, in OS [Old Saxon] 
and later OHG [Old High German] could plausibly have spread as a single wave of 
diffusion across the Continental West Germanic area.” 

The consequence of Walkden’s observation is that all Modern Germanic V2 lan- 
guages have verb movement to Force®, which then would account for the rarity of 
V3 constructions. In the next section I will however argue that this is not the case. 


4. ‘The left periphery of Modern Germanic 


It is often said that the left periphery of the Germanic languages is less rich than 
the Romance one. In the following I will show that there are two different topic 
positions in Norwegian and German, but that the use of topics is more restricted 
than in the Romance languages. It is less clear that these languages make use of a 
Scene Setting position. 

Whereas the structure of embedded clauses has been extensively studied, less 
attention has been payed to the structure of the CP of main declarative V2 clauses. 
It seems however that there is a tendency to claim that Force’® is the landing site of 
the finite verb, and that this head carries an EPP that ensures that the finite verb in 
fact ends up in second place in the clause. Ostb@ (2007) makes this assumption 
when discussing the sd construction in Norwegian (see Section 4.4), and 
Frascarelli and Hinterholzl (2007) conclude similarly when examining German 
data.’ The null hypothesis is also that the complementiser in the Scandinavian 
languages sits in Force® (Wiklund et al., 2007; Lohndal 2009). If the general analy- 
sis of asymmetric V2 languages is to be taken at face value, the site of the comple- 
mentiser must be identical to the landing site of the finite verb. The reason for this 
is that the hallmark of asymmetric V2 languages is the main-embedded asymme- 
tries. An asymmetric V2 language such as Norwegian has V2 in main clauses, but 
SVO in embedded clauses. This asymmetry is caused by the the fact that the 
finite verb in main clauses target the same position as the complementiser in 
embedded ones, which means that verb movement to the CP field is blocked in 


9.  Frascarelli and Hinterhdlzl have a very different approach to these questions, and I will not 
consider their analysis furthermore. 
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subordinate clauses. Consequently, if the complementiser targets Force®, this must 
be the landing site for the finite verb in main clauses. 
4.1 Left dislocation 


It is however quite obvious that the Germanic languages accept some sort of topics 
preceding the finite verb (42). 


(42) Dieter, den kenne ich nicht. 
Dieter, that.one.acc know I. NEG 
“Dieter, I don't know him” (German) 


The question is: What kind of topic is this? Is ita HT or a LD? Recall that Beninca 
and Poletto (2004) state that one of the main differences between HT and LDs is 
that HT only corresponds in person and number, not in case. The example in (42) 
is ambiguous, as it is not clear what case Dieter has. If preproprial determinants are 
added, the picture gets clearer (43) (examples from Jens Haugan, pc.). 


(43) a. Den Dieter, den kenne ich nicht 
the.acc Dieter, thatone.acc know I NEG 
“Dieter, I don’t know him” (German) 
b. *Der Dieter, den kenne ich nicht. 
the.Nom Dieter, that.one.acc know I NEG 
“Dieter, I don't know him” (German) 


The topic must display the same person, number and case as the resumptive pro- 
noun. Dieter must be accusative as in (43-b) and not nominative as in (43-c). This 
is consistent with observations made by Grohmann (2000). 

If a LD carries the same person, number and case as the resumptive pronoun, 
these examples are clear instances of Left Dislocation. The same pattern emerges 
in Icelandic, where the morphology is spelled out on the DP (44) (examples from 
Gunnar Hrafn Hrafnbjargarson, p.c.). In order to be topicalised in the context 
below, Sigga must carry the accusative case as it corresponds to the pronoun hana, 
which is a direct object of the clause. 


(44) a. Siggu, hana __ pekki ég ekki. 
Sigga.acc, her.acc know I NEG 
“Sigga, I don't know her.’ (Icelandic) 
b. *Sigga, hana __ pekki ég ekki. 
Sigga.Nom, her.acc know I_ NEG 
“Sigga, I don’t know her.’ (Icelandic) 
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As for the Mainland Scandinavian languages, where morphology is poorer, it is 
more difficult to tell. However, both Eide (2011) and Faarlund (1992) regard ex- 
amples like the ones in (45) as instances of Left Dislocation rather than Hanging 
Topics. In these examples there is no pause (#) between the fronted element and 
the resumptive pronoun. 


(45) a. Mora mi, hu gikk pa skolen med Jan Terje. 
mother.the my, she went to school.the with Jan Terje 
“My mother, she went to scool with Jan Terje.” 
(Norwegian, eastern dialect) 


b. Meine Mutter, die ist mit ihm zur Schule gegangen. 
my mother she is with him to.the school gone 
“My mother, she went to school with him” (German) 


All these examples permit embedding under bridge verbs, which indicates that the 
fronted element is situated to the right of ForceP. 


(46) a. Han fortalte meg at mora mi, hu gikk pa skolen 
he told me _ that motherthe my, she went to school.the 
med Jan Terje. 
with Jan Terje 
“He told me that my mother went to school with him?” 
(Norwegian, eastern dialect) 


b. Er erzahlte mir, dass meine Mutter, die ist mit ihm zur 
he told me that my mother she is whit him to-the 
Schule gegangen. 
school gone 
“He told me that my mother went to school with him” (German) 


These data all point in one direction, namely that these topics are left dislocated 
topics. It is however quite clear that these constructions do not affect V2 - the 
topics exist in addition to V2, just as they did in Old French. We are thus forced to 
anticipate the following structure of the left periphery - at a minimum. 


(47) Left dislocation ... resumptive element... finite verb 


I will come back to the question of defining the positions for the resumptive ele- 
ment and the finite verb in Section 5.2. 
4.2 Hanging Topics 


The starred examples above ((48-a) and (48-b)) may be acceptable if there is a 
substantial pause after the initial element (48). 
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(48) a. Der Dieter # den kenne ich nicht. 
the.nom Dieter # that.one.acc know I NEG 
“Dieter, I don't know him” (German) 


b. Sigga # hana  pekki ég ekki. 
Sigga.Nom # her.acc know I NEG 
“Sigga, I don’t know her” (Icelandic) 


The presence of an intonation break has been taken to be a diagnostic tool for 
distinguishing between HTs and LDs (Faarlund 1992; Grohmann 2000; Eide 2011). 
The fact that the examples in (48) combine an intonation break with lack of cor- 
respondence in case corroborates this hypothesis. 


4.3. Verb movement to Fin® 


We have seen evidence that strongly suggest that the Modern Germanic languages 
make use of the Left Dislocation position (LDP). This implies that the verb, occur- 
ring to the right of these topics, cannot sit under Force®. Below LDP there are two 
possible landing sites: Foc® and Fin®. The first element may be a contrastive focus, 
but it is not necessarily so. In (49-a) pda Toten ‘at Toter’ is not marked in any way. In 
(49-b) is it possible to add phonetic stress in order to focalise pd Toten.The con- 
struction in (49-c) is perhaps the most common, with the expletive subject det ‘it’ in 
the clause’s first position. It is thus impossible to claim that the first position in the 
V2 clause is marked unless specific stress is added. Faarlund (1992: 90) claims that 
the first position in Norwegian is thematic. I will come back to this in Section 5.3. 


(49) a. Pa Toten regner det. 
on Toten rains it 
“Tt rains at Toten” (Norwegian) 


b. PA ToTEN regner det, men ikke pa Eidsvoll. 
on Toten rains it but not on Eidsvoll 
“Tt rains at Toten, but not at Eidsvoll” (Norwegian) 


c. Det regner pa Toten. 
it rains on Toten 
“Tt rains at Toten” (Norwegian) 


It does not seem possible to claim that the first element in a V2 clause sits in 
SpecFocP as this makes the wrong predictions about the interpretation of the first 
element. In (49-a) pa Toten ‘at Toter’ does not carry any particular stress, and it 
seems unreasonable call it a focus. This in contrast with (49-b), where pa Toten ‘at 
Toter’ clearly carries contrastive focus. In (49-c) the initial element det ‘it’ is an 
expletive, which may not be interpreted as a focus either. 
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Based on these observations, I will claim that the unmarked landing site for 
the finite verb is Fin® and that the unmarked first constituent moves to SpecFinP. 
It is possible that pd Toten ‘at Toter’ in (49-b) is in fact in SpecFocP. If this is cor- 
rect, we see the exact same pattern as in Old French and in Rhaetoromance 
(Poletto 2002) where only si and the moved elements in the CP field may fulfill the 
V2 requirements. 


4.4 Fronted elements and the si/sd construction 


As mentioned earlier 3.4.1, there is a striking similarity between Mainland Scan- 
dinavian (MSc) and Old French, namely the possibility of inserting a sentence 
particle si (Old French) and sé (MSc) after a fronted adverb or adverbial clause 
(see Egerland and Falk (2010) for an extensive comparison). 


(50) a. Quant li rois ot ceste parole, si retret sa men 
when the king heard this word si withdrew his hand 
arrieres 
backwards 
“When the king heard these words, he withdrew his hand” 
(13th c., Trist-prose, 252) 


b. Da  kongen hadde hort dette, sa trakk han handen til 
when king-the had heard this sA drew he hand.the to 
seg. 
himself 
“When the king had heard this, he withdrew his hand.” (Norwegian) 


The two clauses above share the same structure: 
(51) adverbial clause: ... s1/sA ... finite verb. 


Construed this way, the clauses do not have a V2 structure, but a superficial V3 
structure. The construction is quite common. In Radwan’s corpus of 1041 Old 
French clauses from the Roman de Tristan en prose (13th century), 56 clauses con- 
tain the particle si (Radwan 2011). As for Norwegian, Faarlund et al. (1997) stress 
that it is more common in spoken than in written language. There is no semantic 
difference between clauses with sd and the corresponding clauses without the par- 
ticle. The initial clause cannot be an argument of the clause, neither in Old French 
nor in Norwegian (Faarlund 1992) and Old French (Ferraresi and Goldbach 2003). 
The initial element is not necessarily a CP, it may also be a plain adverb. If the Old 
French clause contains an overt subject (52-b), this follows the finite verb, just as 
in Norwegian (52-c). 
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(52) a. A LT endemain si s acorderenta ce qu il se 
on the next.day si REFL agreed on that that they REFL 
departiroient les uns des autres 
would.leave the ones ofthe others 
“They decided that they would leave each other on the following day.’ 
(13th c., Queste, 29) 


b. de celui pais meismes fui ge nez, et si me crestiena 
of this country self was I born and si me baptised 
Joseph d' Abrematie 
Joseph of Arithmatea 
“I was born in this country, and Joseph of Arithmatea baptised me 
there” (13th c., Tristan, 28) 


c. Egentlig sa er jeg litt  torst. 
really sA am I little thirsty 
“Tm actually a bit thirsty” (Norwegian) 


In Old French, the topic preceding si may be a topicalised DP which is co-indexed 
with the subject (53-a). This is not possible in Norwegian 


(53) a. Gorvenal, qui mout estoit saiges et qui mout se _prist 
Gorvenal who much was wise and who much REFL took 
garde de maintes choses, si se _ prist tant garde de 
guard of many things, st REFL took guard of the 


la roine... 

queen 

“Gorvenal, who was very wise and took notice of a lot of things, was so 
aware of the queen ...” (13th c., Trist-prose, 251) 


b. *Gorvenal, som var sveert klok, sa oppdaget ... 
Gorvenal who was very wise sA discovered 
“Gorvenal, who was very wise, discovered.” (Norwegian) 


If si/sd is omitted, Old French will normally have an overt subject between the 
fronted clause and the finite clause (54-a). In Norwegian this construction is im- 
possible, and the subject needs to follow the verb (54-b). 


(54) a. Quant la demoisele vit ceste chose, ele sescrie a haute voiz 
when the miss saw this thing she REFL.cry by high voice 
“When the young woman saw this, she started to scream out loud” 
(13th c., Tris-prose, 247) 
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b. Da  jenta sa dette, (*hun) skrek hun hoyt. 
when girlthe saw this she screamed she loudly 
“When the girl saw this, she screamed loudly” (Norwegian) 


‘The two languages behave thus in different ways with regards to the fronted clause 
as sketched out in (55). 


(55) Old French 
a. clause - si —finite verb — (subject) 
b. clause - subject — finite verb 


(56) Norwegian 
a. clause - sd — finite verb-subject 
b. clause — finite verb — subject 


As we have already seen in Section 3.6.1, a fronted clause does not necessarily trig- 
ger inversion in Old French, contrary to Norwegian. This strongly suggests that 
fronted clauses in Norwegian are moved there, i.e. they occupy either SpecFinP or 
SpecFocP.!° 

What is important to retain is that Old French and Mainland Scandinavian 
both have a CP particle, namely si/sd, which may follow a fronted adverb or an 
adverbial clause and give rise to V3 constructions. 


5. The Topics 


We have seen that both in Old French and in some Modern Germanic languages, 
the finite verb does not move any higher than to Fin®. In fact, the left periphery of 
the languages are more alike than one might think. These are the positions we have 
described for Old French (57-a) and Norwegian (57-b). 


(57) a. Old French: Hanging Topic ... Scene Setting ... Left Dislocation ... 
Focus ... 
b. Norwegian: Hanging Topic ... Left Dislocation ... Focus ... 


More important than the exact positions is that the languages behave in the same 
way: Topics are invisible to the V2 configuration and only elements in SpecFocP or 
SpecFinP may fulfill the V2 requirement. 

When discussing topics in Section 4, we saw that the left dislocated element had 
to have the same case as the resumptive pronoun. This is how we can conclude that 
the fronted XP is in fact a LD. Now consider (43-a) and (44-a), repeated as (58). 


10. As for Norwegian, Ostbe (2007) postulates that the finite verb in ordinary declaratives 
moves to Force®. According to her, sé occupies Foc° thus blocking verb movement to Force’. The 
element preceding sd is in SpecForceP, sd in Foc® and the finite verb in Fin®. 
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(58) a. Siggu, hana _ pekki ég ekki. 
Sigga.acc, her.acc know I NEG 


“Sigga, I don’t know her.” (Icelandic) 
b. Den Dieter, den kenne ich nicht 

the.acc Dieter, that.one.acc know I NEG 

“Dieter, I don't know him” (German) 


In both of the examples, the LD is immediately followed by the resumptive pro- 
noun. In fact, this is the only word order that yields an acceptable result with a LD 
topic (59). 


(59) a. *Siggu, ég pekki hana ekki 


*Sigga.acc, her.acc know I NEG 

“Sigga, I don’t know her.’ (Icelandic) 
b. *Den Dieter, ich kenne den nicht 

the.acc Dieter, that.one.acc know I NEG 

“Dieter, I don't know him” (German) 


The only way to render the clauses acceptable, is by inserting a long pause between 
the fronted element and the first constituent of the V2structure. This pause must 
be substantially longer than the one in (59), and I assume that in such examples 
the fronted element is no longer a LD, but a HT (Eide 2011). The only grammatical 
word order is the one illustrated in(60). 


(60) [4 XP, [einp XP; VI] 


Based on this, we can make the following claim about Germanic clause structure 
(61). The observation is in the spirit of the one made by Grohmann (2000). 


(61) A left dislocated topic can only occur iff it is immediately followed by its 
resumptive pronoun. 


Engdahl (1997) has made a similar observation for Swedish matrix clauses. Ob- 
serve the way the clauses in (62) are bound together (examples in (62) from 
Engdahl). 


(62) a. Igar képte jag en ny jacka. 
yesterday bought I a new jacket 
“Yesterday, I bought a new jacket.” (Swedish) 
b. Den ska jag hae, pa mig ikvall. 
it willl wear e on me_ to-night 
“T will wear it tonight.” (Swedish) 
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When an item is introduced in clause A (en ny jacka ‘a new jacket’ in (63-a)), it 
may be resumed by a fronted pronominal topic in clause B (den ‘it’). It is not neces- 
sary to pronounce the clause in (62-b) with any particular stress on the fronted 
pronoun.'! Engdahl labels this continuous topics and claims that “the speaker uses 
the fronting as a way of establishing cohesion between the two utterances”. This 
way of linking clauses is equally common in Norwegian (63). 


(63) a. I gar kjopte jeg en ny jakke. 
in yesterday bought I a_ new jacket 
“Yesterday, I bought a new jacket.” (Norwegian) 


b. Den skal jeg ha pa megi_ kveld. 
it will I wear on me in evening 
“T will wear it tonight.” (Norwegian) 


Even though what Engdahl describes concerns independent clauses, the parallel is 
striking: An element may be immediately followed by a pronoun with the same 
reference. Either a new element may be introduced in a preceding clause as in 
(63-a), or the fronted element may be a left dislocated topic in the left periphery of 
the clause. In either case, the element preceding the pronoun does not belong to 
the V2 scheme of the following clause. 

In the Romance languages, there is no such requirement. In (64-a) the resump- 
tive pronoun gli, corresponding to the LD di Mario ‘of Mario’ follows the subject in 
the clause. It is the same in French, where the resumptive le ‘him’ follows the subject. 


(64) a. A Mario, gli amici gli hanno fatto un brutto scherzo. 
to Mario the friends to-him have done a bad joke 
“Mario, his friends played a bad joke on him” 
(Italian, from Beninca and Poletto (2004: 64)) 


b. Son frére, je ne le connais pas. 
his brother I NEG him know’ NEG 
“T don’t know his brother” (French) 


What obviously complicates the matters is that the pronominal object in the Ro- 
mance languages is a clitic and may not appear clause initially in declarative claus- 
es. But pronominal clitics arguably also exist in Germanic languages. In Mainland 
Scandinavian, pronouns of the third person singular have clitic forms (Eide 2011). 
The Norwegian pronoun a ‘she’ and it s Swedish counterpart ‘na ‘she’ may not 
double a left dislocated structure. (Swedish examples from Ida Larsson, p.c.) 


11. The fronted pronoun may however also receive contrastive stress. I will not discuss that 
here. 
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(65) a. *Anna, a kjenner jeg godt. 
Anna I know her well 
“Anna, I know her well” (Norwegian) 
b. *Anna, jag kjenner ‘na inte. 
Anna I know her NEG 
“Anna, I don t know her” (Swedish) 


In order to mend the clauses in (65), it is necessary to use the strong form henne 
‘her’ (66). 


(66) a. Anna, henne kjenner jeg godt. 
Anna her know I_ well 
“Anna, I know her well” (Norwegian) 


b. Anna, henne kjenner jag inte. 
Anna her know I = NEG 
“Anna, I don't know her” (Swedish) 


This restriction must not be interpreted as a ban on having ‘a/‘na ‘she’ in initial 
position. After a complementiser, the clitic may introduce the clause (67).'” 


(67) Jeghilste pa Anna men‘a_ kjente meg ikke. 
I greatedon Anna but she knew me NEG 
“T said hello to Anna, but she didn’t recognise me.” (Norwegian) 


It must however be noted that these clitic constructions are less clear-cut than 
constructions with full resumptive pronouns. My Norwegian informants react 
quite differently with regards to the acceptability of a LD topic + pronominal clitic. 
However, when the non-clitic form is used, it has to appear immediately following 
the topic. I will not discuss clitics here. 


5.1 Occupying ForceP? 


Eide (2011) discusses similar examples in Norwegian and proposes that the re- 
sumptive pronoun is base-generated in Force®, and the preceding topic fills its 
specifier. The finite verb sits under Fin®. A clause such as the one in (68) will thus 
have the structure in (69). 


(68) Boka, den fikk Per av mora Si. 
book-the, it; got Peter of mother-the his 
“The book, Peter got it from his mother” (Norwegian) 


12. Thanks to Marit Julien for pointing this out. 
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(69)  [porcep BOka [ den [ re kk les Per av mora si]]]]] 


Force® FinP [ 
The major objection to this analysis is that the derivation is not clear. It raises in 
particular the question of the EPP and howitis checked. If den ‘it’ is base-generated, 
is boka ‘the book’ moved? If it it had been attracted by the EPP to SpecFinP, it 
would have remained there due to some freezing effect. If it has moved directly to 
SpecForceP, the topic construction is followed by aV1 structure. This is highly 
problematic, becauseV1 does not occur in Norwegian other than in topic drop 
contexts (in correspondence, according to Faarlund et al. (1997)). In (68) all the 
thematic positions are expressed, so topic drop is ruled out. Eide’s analysis also 
makes the wrong predictions about the topic continuity constructions in (62) and 
(63), repeated as (70). If den ‘it’ is in fact base-generated in Force in (68), we 
would expect a similar analysis for (70-b). In this case however, the analysis is 
straightforward: the DP den ‘it’ occupies the specifier of the same projection as 
the finite verb. Why den ‘it should behave so differently in such similar construc- 
tions remains unclear. 


(70) a. Igar képte jag en ny  jacka. 
yesterday bought I a new jacket 


“Yesterday, I bought a new jacket.” (Swedish) 
b. Den, ska jag ha e,; pa mig ikvéll. 

it willl wear e on me_ to-night 

“T will wear it tonight.” (Swedish) 


The left dislocated construction cannot embed under other verbs than bridge verbs 
(71), as Eide points out. 


(71) a. Hansa at den boka, den hadde han ikke lest. 
he said that that book-the it had he NgcG read 
He said that he hadn't read that book’ (Norwegian) 


b. *Jeg angrer pa at den boka, den har jeg ikke lest. 
I regret on that that book-the it have I NEG read 
“T regret that I haven't read that book.”(Norwegian) 


According to Beninca and Poletto (2004), bridge verbs are characterised by their 
capacity to select a full CP as it complement. It then seems reasonable to assume 
that the complementiser under a bridge verb sits under Force. That way this posi- 
tion is rendered inaccessible for the left dislocated construction. It must thus oc- 
cupy a position to the right of ForceP. 

Given these considerations, I will argue for a different analysis, namely the one 
where the full DP sits in SpecLDP and the resumptive pronoun in SpecFinP. I will 
try to sketch out a derivation in the next section. 
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5.2 Moved or base-generated? 


In Section 3 I have followed Cinque (1990); Beninca and Poletto (2004) and 
Beninca (2006), who have argued that topics have been base generated in situ.'° 
This analysis does not seem to hold for the Germanic languages. Discussing data 
from German, Grohmann (2000) concludes that the LD elements do not exhibit 
any Weak Crossover effects (WCO), and they show sensibility to islands. Examples 
in (72) (from Grohmann (2000)) show that the fronted element (seinen Vorgarten, 
‘his front lawn’) must at some point in the derivation have been c-commanded by 
the quantified subject (jeder ‘every and jeder Herforder ‘every Herfordiar). 


(72) a. Seinen Vorgarten, den mag jeder Herforder Burger t,. 


his front-lawn it likes every Herfordian dweller 

“His front lawn, every Herfordian likes.” (German) 
b. Seinen Vorgarten, den; glaubt jeder kann er t, 

his front-lawn it believes everybody can he nice 

schon halten. 

keep. 

“His front lawn, everybody believes he can keep pretty’ | (German) 


The same effects arise in Norwegian, where sin ‘his’ is obligatorily bound by the 
quantified subject (in opposition to the unbound form hans ‘his’). 


(73) a. Hagen sin, den, liker alle totninger th: 
garden his it like all people-from-Toten 
“All people from Toten like their garden” (Norwegian) 
b. Hagen sin, den, tror alle at de_ kan stelle t, bra. 
garden his it thinks everybody that they can maintain t, well 
“Everybody thinks they can maintain their garden.” (Norwegian) 


Grohmann’s analysis is that the resumptive pronoun is the spell-out of a copy 
(Grohmann 2000). This way it is the LD-ed element that carries the theta-role in 
the clause, and the resumptive pronoun is just a copy from the process of leftwards 
movement. This means that the clause in (74) must have the derivation in (75). 


(74) Diesen Satz, den mag ich besonders. 
this sentence it like I especially 
“This sentence, [it] I like especially” (German, from Grohmann (2000)) 


13. Beninca opens up for the possibility that LDs have been moved, but does not adopt this 
view. 
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(75) The derivation 
[diesen Satz] [diesen-Satz — den] [mag] [ich] frag} [besonders] {diesenSatz} 


Grohmann (2000) uses a different set of labels of the positions in the left periphery. 
Transferring his analysis to our model, diesen Satz ‘this sentence’ must be a Left 
Dislocated topic, and the resumptive pronoun den it’ must be in either SpecFocP 
or in SpecFinP. As it is impossible that an element is a focus and a topic at the same 
time, den ‘it’ must be in SpecFinP, which is the default landing place.This gives us 
a structure as the one in (76). 


(76) [,pp diesen Satz [,,,» den ...]] 


Grohmann does not discuss why the copy is spelled out, probably at PF. A way to 
deal with this is to use Holmberg (2000)’s split EPP (the Extended Projection 
Principle) approach, where he postulates that the EPP is made up of two compo- 
nents: [D] and [P]. The [D] feature is categorical and must be checked by a 
D-element (a DP or a clitic). The [P] feature requires visibility, in other words that 
the element deleting it must be phonologically realised. 

The deleted copy (diesen Satz ‘this sentence’) in German carries a [D] feature 
and is as such a candidate for fulfilling the EPP. At PF it gets spelled out in order to 
meet the EPP criterion, in what strongly resembles a last resort operation. As no 
other candidate is available as the first element in the clause, the deleted copy is 
spelled out in the form of a resumptive pronoun, presumably because the use of 
the full DP would be a violation of the biding principle C (77). 


(77) Binding principle C: An R-expression is free. 


It is quite obvious that Left Dislocated topics behave differently in Modern German 
and in Old French. Consider (78) (repeated from (28-a)) and compare it to (78-b). 
In the Old French example, the LD topic is followed by the subject, whereas it is 
followed by the resumptive element in German, just as we stated in (61). Hence, 
the rule of adjacency between the LD topic and its resumptive pronoun does not 
apply to Old French. 


(78) a. Cest chevalier, je ne T aim pas. 
this knight I NEG him love NEG 
“As for this knight, I don't love him?” (12th c., Erec, 60) 


b. Diesen Satz, den mag ich besonders. 


Regardless of the origin of the LD topic, the derivation cannot be the same in the 
two examples. Following Grohmann’s analysis, diesen Satz ‘this sentence’ has 
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passed through SpecFinP, which only allows for one element to move through it 
(cf. Robert’s bottleneck approach (Roberts 2004)). If indeed cest chevalier ‘this 
knight’ has been moved, not merged, it cannot have moved through SpecFinP, as 
this position is already occupied by the subject je ‘T. This leaves us with one pos- 
sible analysis for a moved topic, namely that it moves by A’ movement directly to 
SpecLDP without touching base in SpecFinP. This way a moved topic does not 
interfere with the V2 character of the clause. Still under the assumption that topics 
are moved to their surface position, this movement would resemble ordinary top- 
ic fronting in Modern French (or Modern English for that matter), where a topic 
precedes the subject without inversion, as in (79). 


(79) Ce chevalier, je ne Taime pas. 
this knight I NEG him love NEG 
“As for this knight, I don't love him” (Modern French) 


We then have two different explanatory models for the origin of Left Dislocated 
topics in Old French. They are either a) merged in situ orb) moved by A movement 
skipping the bottleneck of SpecFinP. In order to conclude, we need more knowl- 
edge about Q-expressions in Old French, and I leave that to future research. 

No matter whether the topics in the topic field to the right of Force have been 
moved there or merged there, the discussion above leads to one conclusion con- 
cerning the V2 construction: Elements in the topic field are invisible to the V2 
derivation. This way they must co-occur with an element situated below the topic 
field in order to generate a grammatical utterance. 


5.3 The informational value of the fronted element 


Now, regardless of the discussion of move and merge, why is it so that the resump- 
tive pronoun need not be adjacent to the left dislocated noun in Old French? Or, 
turned upside down: Why is it necessary for the same topic to move through 
SpecFinP in Modern Germanic? The key word is information structure. 

The first position in Modern Germanic is normally thematic. Discussing 
Norwegian, Faarlund (1992: 90) writes that the first position of the matrix de- 
clarative clause is thematic in the sense that it represents aboutness. Even though 
it may host the dummy pronoun det ‘it, it may be seen as thematic, representing 
the background. It must be stressed that even though the initial word of the clause 
is thematic, it does not necessarily mean that it is a topic. The discussion in 4.3 
revealed that the first element in Norwegian may be a contrastive focus, an exple- 
tive or some sort of theme. In this case it is necessary to distinguish between dif- 
ferent degrees of topicality. There is the relaxed aboutness-kind which does not 
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bring any particular informational value to the clause. It is a theme, but not a 
topic in the sense we have seen in this paper. In (80-b) der ‘there’ refers to pa 
Toten ‘at Toten’ It picks up the preceding element, as such it is what Engdahl 
(1997) calls a continuous topic, but it is not topicalised in the sense that the atten- 
tion is drawn towards the adverb. If one wants to draw the attention towards the 
location of the bad weather in the examples in (80), it is necessary to use a Left 
Dislocation, as in (80-c). 


(80) a. Det regner pa Toten. 
it rains on Toten 
“Tt is raining at Toten” 


b. Der er det mye diarlig veer. 
there is it much bad weather 
“The weather is often bad there.” 


c. Pa Toten, der er det mye darlig ver. 
on Toten, there is it much bad weather. 
“As for Toten, the weather is often bad there.” 


We have already concluded that the finite verb moves to Fin® in Norwegian. The 
fronted element normally occupies SpecFinP, unless it carries contrastive focus, 
in which case it sits in SpecFocP. I will make the claim that SpecFinP is associated 
with a default value, set as +Thematic. This is in accordance with the analysis put 
forward in Faarlund (1992). Basing my analysis on this, the fronted element 
in both (80-a and b) sits in SpecFinP. In (80-c) it is preceded by a topic in 
SpecLDP. 

It is this default value of the element in SpecFinP that is the source of the 
topicality requirement: There cannot be two different topics in the clause, so the 
Left Dislocated topic must necessarily be co-indexed with the inherently thematic 
first element of the clause. In Grohmann’s model this makes sense too - the 
thematic and topicalised element is moved through SpecFinP to the SpecLDP. 

In the Romance languages, the first position is not reserved for elements car- 
rying a certain information value; it is essentially reserved for the subject of the 
clause. In this respect, Old French occupies an intermediate position. Even 
though Vanelli et al. (1985) suggest that the first position in the Medieval 
Romance V2 languages is thematic, Radwan (2011) shows that the information 
value of the first element in a V2 construction is equally distributed between the 
categories known, unknown and neutral. As such it may not be claimed that the 
first position in the Old French V2 clause is essentially thematic. A consequence 
of this is that there is no contradiction between a fronted topic and a 
non-resumptive element. 
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The difference in the topic constructions in Modern Germanic and Old French 
may thus at the end of the day have to do with the information structural value of 
the first position of the clause. An initial thematic element puts more restrictions 
on a left dislocated topic than a non-thematic first constituent. 

This in turns describes the dual status of Old French very well. From a prag- 
matic point of view, Old French patterns with modern Romance languages in 
that the first position in the clause is not necessarily a topic. From a structural, 
syntactic point of view it patterns with the Modern Germanic V2 languages. It is 
the tension between these two systems that generates word orders where more 
than one element precedes the finite verb. It would be interesting to see if this 
could be seen as a trigger for the loss of V2 in French, a question I leave for fu- 
ture research. 


6. Conclusion 


I have described the left periphery of Old French, and I have compared it to the left 
periphery of Modern Germanic V2 languages. The data indicate that the left pe- 
riphery in Germanic is richer than often assumed. It is possible to discern a posi- 
tion for left dislocated topics in the CP field, a position that is situated between 
FinP and ForceP. 

Discussing the available data, I have concluded that the finite verb moves 
no higher than to Fin° in declarative main clauses in both Old French and 
Modern Germanic. The SpecFinP position has a default +Thematic value in 
Modern Germanic languages, but in Old French the SpecFinP position does not 
seem to be associated with a particular information value. 

In both Old French and Modern Germanic, all kinds of topics are invisible to 
the V2 structure. In the Germanic languages a Left Dislocated Topic is moved 
through SpecFinP, and it is a copy from the chain that is spelled out as a first ele- 
ment in the V2 clause. We do not have enough evidence to conclude whether or 
not Old French LD topics have been moved or merged in Old French. If they have 
been moved, they must have moved by A’ movement, skipping SpecFinP. 


Appendix: Cited texts 


Examples from database 


Cited texts found in the Corpus de la litterature medievale (Lalou et al., 2001) on- 
line data base. 
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Alexis Anonymous: Vie de Saint Alexis (mid 11th c.) 

Erec Chretien de Troyes: Erec et Enide (ca 1170) 

MortA Anonymous: La Mort le Roi Artu (ca 1230) 

Renart-I Anonymous: Le Roman de Renart (1175-1250) 

Rose Guillaume de Lorris & Jean de Meun): Roman de la Rose (12251240, 1275) 
Thébes Anonymous: Le Roman de Thebes (ca 1150) 

Trist-prose Anonymous: Le Roman de Tristan en prose (beginning of the 13th c.). 
Trist-Thom Thomas: Le Roman de Tristan (1172-1175) 

Trubert Douin de Lavesne: Trubert (12th c.) 


Examples retrieved manually 


MortArtu Anonymous: La Mort du roi Arthur (ca 1230), éd David Hult, Paris 2009. 
Queste Anonymous: La Quéte du Saint Graal (1215-1230), ed. Bogdanow & 
Berrie, Manchester 2004 
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Language change 


The developmental logic of the analytic 
past in German and Polish 


An issue of universalism or areal contact? 


Werner Abraham 


Universities of Vienna and Munich 


Other than Russian, but to all appearances like Czech, Polish is developing 

a new auxiliary from mied “have, possess” that it can use for the hitherto 
unknown analytic past. We aim to observe what the status of this innovation 
in Polish is: whether or not it is to be tracked back to a contact phenomenon 
induced by neighboring German; whether its distributions are accompanied, 
or even triggered, by erosion of lexical aspectual and morphological oppos- 
itions; whether or not it follows the same grammaticalizing path as medieval 
German; whether it cuts the same portions in the lexical inventory of verbs on 
the criterion [+motion] yielding szin/byt’# HABEN/miec-pasts. One may duly 
conclude that the appearance of the new Polish periphrastic past is not due to 
contact influence from German. Its grammaticalizing steps are in part consonant 
with those of the early development in German. We shall draw consequences 
from specific differences in the Polish and German developments, and we shall 
try to sketch the unifying component.! 


1. What’s new: The emergence of a novel analytic past tense in Polish? 


Today’s Slavic tense systems are devoid of an analytic past. Tense is generally in- 
tricately fused with aspect, which is pervasive in the Slavic languages. Thus, there 
are only three tenses: present, past, and future. The past is usually expressed in 
what is an old past participle form, of which the erstwhile auxiliary encoding 


1. ‘This paper was written in the framework of the bi-national research project “Un-Cartesian 
Linguistics/ Un-Cartesianische Linguistik” (project leaders E. Leiss/Munich and W. Hinzen/ 
Durham, DFG- LE718/ 9-1) financed by the German Deutsche Forschungsgemeinschaft 
(German Research Society). - I wish to thank Terje Lohndal and Elly van Gelderen for helpful 
suggestions. 
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person and number is lacking. The future has an analytic formation for imperfec- 
tive verbs. However, for perfective predicates present morphology expresses the 
future. There is no present tense for perfective verbs - for a grammatically good 
reason: The Slavic perfective always expresses the result of a prior incremental 
eventive process. 

The point of departure in the present paper is the observation that an analytic 
past may just be emerging in Polish (Piskorz 2011).? As a matter of fact this ap- 
pears to be happening for the second time in the history of Polish. In Old Polish, 
in consonance with the entire Slavia, a participial past emerged. The accompany- 
ing copula was soon dropped due to the strong propensity in the present tense to 
do without the copula. What has remained to this date is the synthetic past formed 
by the original participle agreeing with the subject argument for number and gen- 
der. Notice that the old analytic perfect never made use of an Aux corresponding 
to HAVE. Have is missing in all of Slavia. 

In contrast to, and on top of, the first analytic past in Old Polish, a new ana- 
lytic past appears to emerge in Modern Polish. This time the past constituent 
makes use of the auxiliary mie¢ “have” as carrier of grammatical information of 
finiteness. Simultaneously and perhaps under an identical trigger, several other 
new analytic resultative constructions emerge using an identical paradigm. 
These innovations appear as predecessors to new tenses that Modern Polish has 
not had so far: the new miec-pluperfect and the equally new miec-future-in- 
the-past. 

Yet, the status of the three novel construals — past, past-past, and future-in the- 
past, all analytically formed - is not clarified in Polish grammar writing. The main 
obstacle to acknowledging the temporal status of the new analytic forms is the fact 
that too many of the distributional facts congeal under aspectual, resultative-per- 
fective constraints. This is strongly reminiscent of the emergence of the analytic 
past in the history of German, in the period of Old High German/OHG (750-1050) 
(see Abraham 1990, 1991a, b; Leiss 2002). 

In what follows I sketch the oncoming mie¢c-Past in Modern spoken Polish 
under the perspective of its grammaticalizing emergence. Comparing this process 
with what we know about the emerging analytic past in OHG the claim will be 
advanced that the Polish mie¢ + participle passive-active construal follows a uni- 
versal pattern which is used, and will be used cross- linguistically to create a new 
structural past category. 


2. When writing this paper (2010) I was not able to read and refer to Piskorz 2012. 
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2. The logic of emergence of the analytic past: German 


The diachronic emergence of the Germanic past, [Aux/HABEN + past/anterior par- 
ticiple/ PP], progressed along the following constraining intermediate steps: 


(1) Only transitive resultative verbs entered the construal such that the PP 
had a resultative/adjectival reading and agreed with the transitive object 
by number and case yielding more or less adjectival readings as secondary 
predications (small clauses); 

(2) the aspectual-resultative constraint was released such that imperfective 
verbs also underwent the new past mechanics —- accompanied by the re- 
analysis of the full verb (of possession) haben “have” as an Aux; 

(3) the transitivity constraint was released; 

(4) anew Aux differential between haben and sein emerged along the criteria 
of verbs of motion and unaccusatives. 


In path-like form, the development can be sketched as follows (Abraham 1990, 
1991a, b; Bybee et al. 1994: 68ff; Leiss 1992: 280; Leiss 2000): 


(5) transitive Perfective/Resultative Present —> transitive Anterior/Past > in- 
transitive Past analytic Past 


In each of the grammaticalizing steps in (5) both a verb and a participle is involved. 
In the initial stage, possibly up to second to last step, the verb has full lexical status 
(habén “possess” and sin “be” as a copula), and the participial verb can only have 
adjectival status. What we have to be aware of when evaluating the different gram- 
maticalizing steps in (5) is the question how the adjectival participle meaning trans- 
fers to the active past meaning of the later tensed past. We shall return to this. 

The chronological steps (ranging from the 9th to the 15th century: cf. Oubouzar 
1974, Gronvik 1986, Abraham 1991, 1999; Leiss 1992, 2002; Eroms 1997, Kuroda 
1999, Kotin 2000, Zeman 2010) are all marked by a strong perfective impact. Not 
until the end state of the grammaticalizing process is reached does aspect cease to 
function as past tense. Imperfective verbs were excluded from entering the ana- 
lytic past construal in OHG. Furthermore, the perfective participle agreed in gen- 
der, case, and number with the direct object, which was an early indismissible 
component of the constituent also (Abraham 1990). In other words, the Perfective 
slot in (6) had to raise to the agreement slot for ACCUSATIVE case. Transitivity 
implied AccusaTIVE assignment and still does in Modern German. 
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(6) V-Comp/I 
DATIVE-goal 


——» ACCUSATIVE- Theme 


a 


GENITIVE-source 


ee ee 


Directional P-ACCUSATIVE 


Sa 


Perfective 


ae 


Imperfective 


—_— 


The historical source of the analytic/periphrastic past is the perfective aspect — 
itself clearly denoting past eventivity in Germanic (as opposed to Slavic, where it 
encodes future eventivity, at least when expressed in the present tense). This is 
signaled by the fact that only perfective/telic verbs entered the early “past” con- 
strual made finite by the Auxiliaries habén or sin/wesan (Abraham 1992, Leiss 
1992: 283, 289). 

One main feature of the emergence of the analytic past tense in old German 
was the fact that the aspectual system bleached and totally disappeared. Inversely 
corresponding to the emergence of past tense, the perfective prefix ga-/ge- was 
subject to a total demise with the result that perfective verbs could no longer be 
distinguished from imperfective ones. In other words, the original ga-/ge-verbs 
fused with the simple stems and lost the original functional opposition. Aspect, as 
expressed in the lexical pairs of opposition, disappeared from the grammar of 
OHG (Leiss 1992: 68ff; 2002a: 12ff.; 2002b: 26-41). Since the initial tense attempts 
in OHG were restricted to perfective predicates, imperfective verbs were not found 
to instantiate this early tensing attempt. See (7a—d) for what may be a due repre- 
sentation of the interim function of the perfective morpheme gi-. [ts = speech act 
time, tr = reference time, in Reichenbach’s terminology] 


(7) a. gi-tuo-t a OHG 
PERF-d0O-PRES.3SG 
“has done/did” 
b. ha-t ge-tan “& Modern German 


have. AUX-PRES.3SG PAST-dO.PARTICIPLE 
“has done/did” 
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te, te, tr 
c. |>>>>>|--------- ... aspectual difference between tuon “be 
(gi)tuo- gituo- doing” and gituon “finish doing”; see (iii) below 


d. te<tr — te<ts: this change (from aspect to tense) is 

i. valid for OHG (7a) with additional bi-implication between the 
2 phases in (7c), te, and te,. 

ii, valid for OHG (7a) with additional bi-implication between the 
2 phases in (7c), but just sharing with (7c) (te, + te,) < tr/ts. 

iii. valid for OHG (7a) with no additional bi-implication between the 
two phases in (7c) as well as loss of tr as the right border of the 
event as in (7c), but replacing tr by ts in te<ts in (7c). 

iv. On the assumption that the erstwhile adjectival participle has to 
develop into an active past participle, we need to start with a cate- 
gory from which both later options can be derived. I take this cat- 
egorial pre-passive status -cum active participle to be an Anterior 
with the following defining components: 


Anterior/Past (participle) is defined by te<tr 

Passive participle is defined by te<tr + te and tr are in bi- 
implicative relation 

Active past participle s defined by te<ts 


Anterior, thus, is a quasi-perfective notation. All it does is define posteriorship, 
but without the mutual implication that characterizes aspectual perfectivity. Both 
later readings of the participle are triggered by the selection of the full verb mean- 
ings in the initial stage of the grammaticalization and the Auxiliaries, haben vs. 
sein, in the terminating phase of the diachronic development. Initially, haben vs. 
sein are full verbs: haben is a pseudo-transitive with accusative object subcatego- 
rization (valence), whereas sein is a copula with clear subcategorizing restrictions. 
As soon as their full verb status changes for auxiliary status, their subcategorizing 
properties also change: haben takes second participles only and becomes the car- 
rier of tense and finiteness features only, while the Aux sein only subcategorizes 
for second participles. The change from adjectival participle in the initial gram- 
maticalizing stage to passive or active participle is contingent upon the emergence 
of Aux status of both verbs and the syntactic fusion of the new Aux, in terms of 
tense and finiteness status only, and the participle as no longer serving a predica- 
tive adjectival function. This growth in structure and loss in lexical meaning can 
be sketched as in (8). 


(8) a. Initial stage: participle is fully lexically adjectival either in sein- intran- 
sitivity or in haben-transitivity with secondary predication. 
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b. Intermediate stage: Anterior status of the second participle predates 
the decision in the terminating stage: from te<tr to te<tr + te and tr in 
bi-implication yields the adjectival passive - from which eventually 
the event passive emerges by Aux subcategorization and three analytic 
pasts (haben for transitives, sein for intransitives, all with the excep- 
tion of verbs of motion, which take sein in perfective clause readings 
and haben in imperfective interpretations). 

c. Terminating stage: participle disambiguation dependent on the 
Aux choice: sein selects adjectival-passive participle, while haben se- 
lects verbal active participle (again exception in line with (8b).) 


Let’s return to (7a—d): It is a semi-formalized representation of the grammaticalizing 
reinterpretation of the erstwhile aspectual predication of the direct object referent. 
The OHG gituot falls into three components: the lexical stem tuo and the two 
grammatical morphemes for perfective aspect (the prefix gi-) and the finitizing 
morpheme /-t/: this is represented by (i) in (7d), assuming that there was a period 
of transition in OHG when the eventive resultative component was felt to be past 
due to (7c) with tel + te2 in (ii). Finally, the eventive property of the perfective 
biphasicness was given up since such biphasicness was no longer encoded by as- 
pect oppositions and the loss of gi-. This resulted in tense properties as in (iii) in 
(7d) -i.e., pure te<ts. The determining step was tr—ts: the newly developed speech 
act time required for the notion of tense as opposed to tr, which was sufficient for 
the notion of the eventive aspect. Aspect is a notion where reference to the speak- 
er and his action in encoding the proposition p in terms of the event properties 
was not required, and tense was achieved by attenuating eventive qualities in favor 
of temporal serialization of the event components. The latter is exactly symbolized 
in (ii) of (7d) above. 

When Leiss (2002b: 31; 34) draws the somewhat puzzling conclusion that the 
first tensed past constructions in OHG were substitute construals for lacking im- 
perfective verbs (“In its inceptive phase, the Perfect is the ‘Imperfective of perfec- 
tive verbs:”) what she means is exactly the eventive- temporal squish in (ii) of (7d) 
above. While (i) in (7d) tolerates only inherently or gi-marked perfective as well as 
transitive verbs to express what was to be expressed by the analytic construal with 
OHG habén “have” as a property achieved in the past on the referent of the direct 
(ACCUSATIVE) object, it ceased to be activated in due process by the attenuating 
presence of the perfective prefix gi-. 

It is to be noted that the fact that only transitives entered into the early tense 
construal follows from the eventive property in (7c). The transitivity requirement 
follows from the fact that the finitizing verb, habén, was a transitive lexically full 
verb governing the accusative like English own. If OHG, along with all Germanic 
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languages, had not focused on the resultative component in the bi-phasic notion 
of perfectivity, the derivation of the past tense would not have been facilitated in 
the first place. Thus, the resultative phase and the development of the past tense 
imply one another. To highlight this is not trivial for the very reason that Slavic 
focuses not on the second, resultative phase of its perfective notion, but on the 
first, incremental phase. By consequence, using perfective verbs in the present de- 
notes the future, not the past as in Germant(ic). sin/wésan (+ participle), on the 
other hand, was the copula predicating the adjectival, or resultative, participle for 
finiteness? The subject is a Patient and determines agreement on the adjectival 
participle. The adjectival participle, as a perfective derivative, always encodes re- 
sultativity following the incremental phase. Its past denotation is only implied by 
eventive bi-phasicness. Since devoid of any speaker-time implication the high- 
lighted resultative component expresses both continuity and secondary eventivity 
after the incremental eventive phase (Oubouzar 1974; Abraham 1991a; Eroms 
1997; Kuroda 1997; Kotin 2000; Leiss 2002b), 


(9) Ir sizzét inthera burgi,unz ir sit giuuatite mit megine fon 
you sit.suByin this town until you are clad.pL with might from 


héhi. 
above 
“You may sit in this town until you will be endowed with power from 
above.” Tatian 244,1 


- plural declension marking on the participial in ir sit giuudtite.pL ‘you-are- clad’ 
as well as the copula sit renders the periphrastic unambiguously adjectival - and 
not verbally tensed. This is the early, still aspectual, stage of the analytic form. 
Adjectivity of the participial form, however, renders the category [+N, +V], a cat- 
egorially mixed form between verb and nominal status. The next stage in the se- 
quence in the grammaticalizing process allowed transitive verbs selecting haben 
“have” to enter the analytic pattern with the adjectival participle yielding second- 
ary object predications. The agreement declension was then between the participle 
and the transitive object - for example, patterned after the Late Latin and the OHG 
examples (Tatian 102,2) in (10a,b). 


(10) a. habé-o arbor-em planta-tum 
have-1sG.PRES a tree-M.SG.ACC planted-PART.M.SG.ACC 
b. phigboum habeta sum giflanzotan, in sinemo uuingarten 


figtree had.3sc as__ planted in his vineyard 


The latter only allowed for a temporal reading on the basis of the adjectival parti- 
ciple reading: Something that is the result of a prior incremental process - cf. (7c) 
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above — comes later and allows for a past interpretation. Still, because of the adjec- 
tival object reading, only perfective-telic verbs entered the construal. It appears 
that this stage in the grammaticalizing process - clearly a prerequisite for the later 
active predication to achieve — anticipated the next step to come: transitive ‘to 
have/possess something,’ equals intransitive ‘be with something,, where the erst- 
while copula be reanalyzes, or more precisely, recategorizes for auxiliary status on 
the grounds that the predicate is no longer restricted to perfectives (Jung 2011). 
Consider again (7c) with te<tr > te<ts: The aspectual characterization te<tr gives 
way to the inclusion of speaker time and thus tensing te<ts (Abraham 1990, 1991b; 
Leiss 1992: 164). The intransitive construal obtaining is relieved of the direct ob- 
ject, which makes it easier to avoid secondary object predication and reread the 
construal as an active event description. See (11) (cf. Abraham 2000). [PREF = 
verbal prefix] 


(11) a. Die Bume ist verbliht.pERF ... ist = copula 
the flower is PREF.bloomed 


b. Das Pferd hat seine Fesseln bandagiert.PERF .. hat = copula 
the horse has his ankles bandaged 


c. Der Wanderer hat 1 Meile zuriickgelegt .. hat = auxiliary 
the wanderer has 1 mile covered 


d. Der Wanderer ist gegangen.IMPERF .. ist = auxiliary 
the wanderer is walked 


e. Die Blume hat gebliiht.mpERF .. hat = auxiliary 
the flower has bloomed 


(11b) is unambiguously predicating the direct object secondarily, with hat as pos- 
sessive full verb, and an aspectual reading. In contrast, (11c) provides a tensed 
reading, with hat no longer as possessive full verb status, but as an auxiliary. In 
(11b) the subject Pferd “horse” cannot be in Agent status, but Wanderer in (11c) 
can. The trigger for the latter reading — active tense — was probably the frequency 
of AGENT subjects instead of ExPERIENCER roles adapted to the reading of posses- 
sive HABEN.What starts out as a biphasic event projection with object adjectival 
attribution terminates as a monophasic speech-time oriented, tensed event projec- 
tion. This holds both for transitive verbs and intransitive ones. Today’s distribution 
of sein “be” vs. haben “have” across the entire verbal lexicon goes back to the erst- 
while distinction of perfective vs. imperfective verbs irrespective of their (in) 
transitivity status. See (12) for an OHG example standing for an unambiguous 
object-passive reading in the sense of the adjectival participle and secondary pred- 
ication in the initial stage of the grammaticalizing process, similarly to Late Latin 
(10) and Modern German (11b) above. 
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(12) Ir den christanum namun intfangan eigut. 
you the Christian.M.sG.ACC name.M.SG.ACC received. M.SG.ACC have 
‘you the Christian name (as a) received (one) have? 
Exhortatio (Zeman 2010: 185) 


Reinterpretation of the participial meaning from adjectival/passive participle ap- 
position to active anterior follows from the distinction between perfective and 
imperfective predicativity. See (13a,b) for the distinction between the “inner” per- 
fective aspect as opposed to the “outer” aspect of imperfectivity. Terminology; <e> 
and <r> are Reichenbach’s (Reichenbach 1947) reference points on the time scale: 
<e> for <event>, <r> for <referent/viewer point>, <s> for <speech act point>; 
<€,¢> for <incremental event phase, e,...,, for <resultative event phase>; see 
Abraham 2009: 288f.). (13a) represents a monophasic-imperfective event type, 
while (13b) represents a biphasic-perfective event type. Notice the terminology 
“phase” as used here, which has nothing to do with “phase” in Minimalism. 


e 
(13) a, wenn nn nnn nnnn 
Moss bsite view ofr, ..., one, ~~~~~, 
(<e . . » 
r undershoots e - “inner viewpoint 
Cincr © result 
b. |<<<<<<<|------------ 
beg ee heen rr view ofr, ..., one, |<<<|------, 
r overshoots e, where 


1<3 : : »> 
Ce xa outer viewpoint 


incr result — 
The change from aspect based predicativity to tensed predicativity can be illus- 
trated as the change from (14) to (15). As far as the concept of aspect is concerned, 
it is assumed that Modern Polish and Old High German have more or less lexical 
aspect, or Aktionsart: There are clear lexical oppositions keeping apart the perfec- 
tive-imperfective meanings (cf. OHG gatuon vs. tuon “complete vs. “do”; napisat’ 
“complete writing” vs. pisat’ “write”); imperfectives are the unmarked, stem-deri- 
ved, lexemes, while perfectives are marked and prefix-derived in Russian, and 
particle-derived in OHG. Such aspectual representatives are therefore merged pri- 
marily as verbals. Secondary, complex perfectives come about by co-construction 
with directional prepositional phrases and with paradigmatic case change both in 
Russian and OHG. See Abraham (1997) and Leiss (2000). This distributional sce- 
nario is behind the decisions for the structural representations in (14) and (15). 
Notice that OHG had not yet reference-determining articles, on a par with 
Modern Polish. Both (Im)Perfectivel and (Im)Perfective2 can go either way to 
satisfy accusative or genitive cases and thereby achieving either indefiniteness 
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(with genitive) or definiteness (with accusative). In other words, instead of article 
forms, the case differential encodes the referentiality on the object nominal. 


(14) Lexical and structural perfective aspects in OHG: Co-construction of (Im) 
perfectivel,2 with Genitive/Accusative determines referentiality on the 
object-NP (+definite vs. —definite) 


V-Comp/I 
DATIVE-goal 
-—> ACCUSATIVE- Theme 


+» GENITIVE-source 


Sa 


Directional P-ACCUSATIVE 


DAs 


(Im)Perfective2 


aa? 
ieee 


(Im)Perfectivel 


(15) OHG (Im)Perfectives in transition to tensed structuring: After the demise 
of aspect oppositions, paradigmatic case change had no impact on refer- 
ence decisions. Verbal Genitive valence became subject to near-total de- 
mise in Modern German. 


V-Comp/I 
-—> (analytic) past-future TenseP 
|}_______» (analytic) present TP 
ACCUSATIVE-Theme 
GENEFHE -Seurce/Partitive 
Directional P-ACCUSATIVE 


Gat 


(Im)Perfective 


— 
PO 


(Im)Perfective 
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The hierarchical distinction between present and past tense(s) in (15) is based on 
markedness differences: the present is less marked than past and future, not to 
mention pluperfect and future-in-the-perfect as well. 

This concludes the discussion of the emergence of the analytic/periphrastic 
past in German from a totally tenseless, aspect-based source in OHG. We shall now 
turn to Polish, which has never had any analytic past tense in its recent past. Yet, 
there is indication that Polish just might complete what are the beginnings of such 
an analytic past. The signals indicative of such an initial process of grammaticaliza- 
tion are similar to those of OHG leading to past tense in Modern German. 


3. Signs of a newly emerging analytic past in spoken Polish 


‘The past has a synthetic form in Modern Polish: see byt - byta - byto in (14). 


(16) To byt nasz nauczyciel - byla nasza nauczycielka - bylo nasze pudelko 
this/it was our.M teacher.m ourF lady-teacher our.N box.N 


What seems like the finite function of the past copula in (16) is in fact an anterior 
participle with predicative adjectival status: It agrees in gender with the subject 
nominal as is the case for most Slavic languages. Clearly, this agreeing form hides 
an earlier periphrastic past which lost its auxiliary-copula component - quite in 
line with other copula constructions. Formally this phenomenon covers what is 
the adjectival passive albeit without any agreement in Modern German. It corre- 
sponds to the second last stage in the German grammaticalization path in (5) 
above, still not a tensing stage in the development. 

Simultaneously, however, spoken modern Polish is in the process of unfolding, 
before our eyes so to speak, a novel analytic past with the auxiliary mie¢ “have”. The 
construal is analytic like the German past periphrasis: mie¢c have’ + adjectival pas- 
sive participle (+ direct object) as illustrated by (17) (gleaned from Piskorz 2011). 


(17) Masz napisane wypracowanie? 
have. PRES.2.PS.SG written.PART.N.SG essay.ACC.N.SG 
“Have you finished writing the essay?” 


Although it is commonly assumed that none of the Slavic languages has an ana- 
lytic past, the pattern in (17) nevertheless shows up with some regularity. Accord- 
ing to a variety of authors (Koronczewski 1986: 277ff.; Giger 2003.), it occurs in 
Macedonian, Serbo-Croatian, Slovenian, Czech, Slovakian, Bulgarian, and in the 
North-Russian dialects. However, the form is not documented in grammar writ- 
ing (with the exception of Katny 1999a,b and Bartnicka 2004, both for Polish). It is 
banned from the written standards. It use is restricted to private letter writing, 
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diaries, and dialogue sequences, all linguistic genres that show close affinity to oral 
usage (Zeman 2010, Piskorz 2011). Clearly, there is a close relationship to the 
genre classification of the early appearance of the analytic past in German (cf. the 
‘South German preterit demise’ which emerged exclusively in regiolects and dia- 
lects of the overwhelmingly rustic South German areas; cf. Lindgren 1957; 
Abraham & Conradie 2001). 

According to the traditional grammars, written Standard Polish uses the fol- 
lowing four tenses: Present, Preterit/simple Past, Future, and Pluperfect; the latter 
counts as archaic, and it is used infrequently. The main and frequently used 
(simple) past is the original analytic past participle, which is inherited from Proto- 
Slavic and from which the copula component has been grammaticalized away. 
Cf. (16) above. However, Piskorz 2011 shows that spoken Modern Polish has a 
more complex paradigmatic structure which distinguishes the following analytic 
construals using the verb mie¢ ‘have’ + passive participle of the full verb, called the 
‘new miec + Participle Passive-Perfect group’ (Piskorz 2011). This paradigmatic 
group consists of the following 3 construal types: see (18). 


(18) a. miec-past: Mam zdane egzaminy 
(I) have taken (the) exams 
b. miec-pluperfect: Mialem zdane egzaminy 
(I) had taken (the) exams 
c. miec-future in the past: Bede mie¢ zdane egzaminy/ 
(1) will have taken (the) exams 
Bede mial zdane egzaminy 
(I) will had taken (the) exams 


“(1) would have taken” 


Let us conclude from (18) that miec is the finite component for past tense constru- 
als of the types recorded in (18a-c). The past obviously emerges from the anterior 
participle (zdane in (18) above). Since bede “be” is not directly participating in the 
encoding of the past, and since Polish has no regular alternation between be and 
have in lexical dependence of the type of verb (as German and Dutch do), the 
mie¢-HAVE construal cannot be on its grammaticalizing way toward anything but 
inducing the active past use of the anterior participle. 


4. Grammaticalizing into the new analytic active past in Modern Polish 
According to the latest stages of grammar writing (Piskorz 2011), the complex 


grammatical form miec + Anterior Participle is seen either as the seed of a new 
analytical form which could grow out as a new tense (as advocated by Nitsch 
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1913: 102; Pisarkowa 1964: 233ff; Vasilev 1968; Weydt & Kazmierczak 1999: 13ff.; 
Piskorz 2011), or it has simply retained its analytic resultative state, i.e., it is no 
more than a variant of the adjectival passive by¢ “be” + Passive participle (cf. Weiss 
1977; Maslov 1988; Koronczewski 1987; Lazinski 2001; Giger 2003; Katny 2010). 
It was first described for dialects such as Pomeranian and Kashubian and then 
continuously observed in all of oral Polish and genres of near-oral speech. Argu- 
ments in favor of the tense solution are the growing absence of agreement marking 
on the participle, the fact that objects no longer occur in this construal, and that 
the subject adopts clear Agent status. 

Piskorz (2011), working with query lists and large corpora of spoken regiolect 
and near-spoken genres, comes to the following conclusion: The new analytic mieé 
-construal still has, to all distributional appearances, features of both past tense 
and passive state. What makes discrete decisions tougher is also the fact that many 
of the illustrative patterns are in different states of grammaticalization dependent 
on the generation of the speaker. See the differentiation of possible readings in 
(18a-c) above. 

Let us briefly go through each of the three options and base the conclusions on 
sound examples. Note principally that mieé always may retain its possessive se- 
mantics. The subject denotes the possessor, while the object stands for the pos- 
sessed object. 


(19) Mam _ popsuty samochdd. 
(I) have (a) broken car 


Option (18a) is related to the full verb usage in (19). The resultative mie¢-Past is 
characterized by the fact that the external argument, the clausal subject, shares its 
agentivity with the Agent of the previous clause and its eventive property. This 
makes the event active as in (18): 


(20) Ania ma juz zdane wszystkie egzaminy. 
Ania has already passed all examinations 


In the resultative miec-passive reading the clausal subject is not an Agent, but a 
Beneficient; the mie¢ + passive participle construal has features of an intransitive as 
in (19). The miec-passive occurs most frequently with three- place verbs. See (19). 


(21) Odlat mam obiecywang podwyzke. 
for years (I) have/got promised __ pay raise 
“For years, they promised me a pay raise.” 


The critical question appears to be whether the restrictions on the formation of the 
mieé-Past have to do with Polish aspect and to what extent this controls its selec- 
tion criteria of the temporal reading. In this respect, there is a strong convergence 
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to be identified with the emergence of the analytic past in Old High German. 
Compare, first, (22) for the grammaticalization chain in OHG repeating partly (5) 
above and extending it by specifying the four different grammaticalization steps in 
(23a—d) below. 


(22) transitive Perfective/Resultative Present — transitive Anterior/Past > in- 
transitive Past > analytic Past 


(23) a. two-phase secondary predication (small clause) + haben/possess (x,y): 
te<tr 
b. one-phase + haben/Aux(x,y): te<tr/ts — “Anterior/Past” 
c. one-phase + sein/Cop(x): te<tr 
d. one-phase + sein/Aux(x): te<ts 


The convergence with Polish starts with the first step, (23a), the past passive in 
transitive construals to be formed exclusively from perfective verbs. See (24). 


(24) The new Polish construal forms prevailingly with perfective transitive 
verbs. This corroborates the two-phased character of the denoted event as 
in (23a) above: ‘The first eventive phase is represented by mie¢ in present 
tense, while the second phase bears the eventive character referred to by 
the Anterior participle of the perfective predicate. See (25) with the main 
perfective predicate and its participial form zdane. 


(25) Ania ma juz zdane wszystkie egzaminy. 
Ania has already passed all exams 


The use of mie¢ in (25) means that Ania, at the time of the speech act, ts, is in the 
possession of reports testifying the content of zdane (wszystkie egzaminy) that the 
exams have been passed, te<tr. In other words, this is still far away from the active 
tensed construal. 


5. Signals testifying to the new development of analytic tensing 


However, there are signals that the grammaticalization has reached further steps 
on the scale of (23): ie. that it opens for the option in (23b), mie¢ + Anterior Par- 
ticiple Passive, and even further to embrace perfective intransitives. See (26) for 
the iterative perfective solution (the following illustrations are due to Piskorz 
2011). 


(26) Naszasasiadka ma ciagle remontowane mieszkanie. 
our neighbor has repeatedly brushed up _ her appartment 
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Nevertheless, some of the original restrictions are maintained since mieé + An- 
terior participle excludes linking with imperfective intransitive verbs, psych-verbs 
and verbs of motion, as Piskorz (2011) emphasizes. Full reanalyzed examples are 
still in the minority: 76% of the total evidence for mie¢ + Anterior/Past participle 
shows agreement between the transitive object and the adjectival participle. Sub- 
ject agreement in the nominative is still relatively scarce. Note that if the participle 
were applied post-attributively to nic “nothing” the partitive genitive would be 
required — but it is not as (27) testifies. This is indicative of the loss of adjectival 
status of the Anterior/Past participle and its collapse with mamy as well as the 
recategorization as auxiliary of the latter. 


(27) Niemamy nic zaplanowane. 
not (we) have nothing planned.ANT-PART.NEUT.NOM 
“We have not planned anything.” 


The fact that the Anterior participle is in neuter declension signals that the object 
is no longer part of secondary predication and that the participle is not in attribu- 
tive relation to the object. This may be linked to the fact that mieé has already the 
status of Aux. 


(28) Masz juz posprzatane? 
(you) have already cleaned up.ANT-PART.NEUT.NOM 
“Have you cleaned up already?” 


Furthermore, complement infinitivals after the miec-Participle are clear indica- 
tions of step (23d) in Polish. Notice that, clearly, mieé must have lost its original 
meaning as lexical “possess”. 


(29) Ania ma zaplanowane przyjechac latem. 
Ania has.3sG.pREs planned.ANT-PART.NEUT.ACC (to) come in summer 
“Ania has planned to come in the summer.” 


The above is in line with the appearance of the mie¢-Anterior in construction 
with modal verbs as in (30). 


(30) Do jutra musze mie¢ zaplacony czynsz. 
till tomorrow (I) must have paid. ANT-PART.SG.M rent.ACC 
“T must have paid my rent (up) until tomorrow.’ 


The analytic miec-Pluperfect is illustrated in (31), while the analytic mieé- Fu- 
ture-in-the-past is shown in (32) (from Piskorz 2011). 


(31) Zanim rozpoczelam prace w banku, 
before (I) begin.pAsT.PERF work.acc.sG.F in bank 
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miatam juz ukonczone studia. 

(I) had already completed. ANT-PART.M.PL Studium.Acc.PL 

“T had already completed my studies before I began working in the bank” 
(32) Odwiedze cie, gdy bede mie¢ 

(I) visit-PasT you when (I) be.FuT have.INF.IMPERF 

zdane wszystkie egzaminy. 

passed. ANT-PART.PL all exams.ACC.PL.M 

“Once I have passed all exams I will visit you.” 


Each of the above illustrations testifies to the fact that (i) mie¢ is on its way from 
lexical to auxiliary verb, (ii) the Anterior participle is gradually losing its predica- 
tive adjectival status, and that (iii) the new analytic verbal form is growing to- 
gether as a singular predicative function, but has not yet reached the state as the 
analytic past in German. 


6. Conclusion - summary 


What we were really interested in is whether grammaticalizing changes, once they 
start from similar points of departure towards analytic forms, follow a common 
grammaticalizing path. We have witnessed a far-ranging convergence of the emer- 
gence of analytic pasts in the history of German and modern spoken Polish. To be 
sure, the Polish development has not reached the terminative state of the German 
grammaticalizing path. However, the developmental stages of modern spoken 
Polish covering the major part on the grammaticalizing path of the German ana- 
lytic past warrant the expectation that the emergence of an analytic past tense will 
be completed in the future in the same fashion. 

The intriguing factors about this novel development in Polish are the follow- 
ing ones. First, is it really so that Polish is giving up the aspectual paradigm it has 
fared so well with so far, and if really so, why? Second, the process of aspectual 
loss in OHG was accompanied by the loss of the genitive as a structural case 
(Abraham 1997; Leiss 2000: 175ff.). Was this a necessary outgrowth of the demise 
of aspect? Is the genitive in Polish not structural (is it, e.g., only in partitive func- 
tion, as in (21) after negative operators) such that the weakening of aspect will not 
necessarily have an impact on the case paradigm in Polish? Third, there are devel- 
opments towards analytic pasts in other West Slavic languages (Koronczewski 
1986, Giger 2003). What speaks against the assumption that this is due to contact 
influence from German is the long temporal caesura between the German and 
the Slavic development: Why did such contact not exercise its impact much 
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earlier? Based on the temporal distance between the similar developments, we 
draw the conclusion that contact has no role to play. However, this brings us to a 
cross-linguistic conclusion: The emergence of analytic predicatives appear to fol- 
low one common line of grammaticalization as long as they start from the same 
aspectual and auxiliarizing premises. Note, in this context, that the start in 
German was different from that of Polish in that German began with two Aux 
candidates: one for BE for imperfective intransitives and another one for HAVE 
selected by transitives. Polish only feeds HAVE into the new analytic complex. The 
grammaticalizing distance the new analytic past in Polish has to be covered is far 
larger in that the transition from have to what needs to be BE with intransitives 
requires far more fundamental reanalysis. 

The new Polish analytic past is still considerably constrained by aspectual con- 
tingencies. It possesses characteristics that historical German shared in its devel- 
opmental phase with the resultative participle. In other words, Polish is still at a 
distance from having an analytic temporal. The constraining factors are that the 
new form appears with perfective transitive verbs in the majority of the evidenced 
cases, the preferred status of gender and case agreement being between the transi- 
tive object and the Anterior participle, and the resultative character of the parti- 
ciple. This is indicative of secondary predication - which is very clearly an asset of 
aspectual status. Furthermore, the only verb capable of becoming finite in this 
development has by far not always reached auxiliary status. The most promising 
signal for the outgrowth of analytic tense is that the complex form is opening to 
the classes of intransitives and imperfective verbs. The growing loss of agreement 
on the participle as well as new Agent subjects (notice that lexically full mie¢ “pos- 
sess” does not select an Agent) speak in favor of emergent tensing. The fact that a 
new pluperfect and future-in-the-past emerge on the identical analytic formation 
strengthens the expectation that the entire temporal paradigm is in flux toward a 
radical reanalysis in Modern Polish. 

Whatever the general, cross-linguistic tenability of our grammaticalizing path 
sketched for both OHG and Polish, Faarlund’s (2008) characterization of gram- 
maticalization holds, namely that it is not a theory in the narrow sense (see already 
Abraham (2004)), although the described phenomena are all to the point. In par- 
ticular, each of the grammaticalizing steps that are valid for the two language states 
conform to the unidirectionality principle: It is unimaginable to reverse the gram- 
maticalizing process from a synthetic stage to an analytic stage - notably without 
any change in function or meaning (cf. Askedal 2008). 
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This paper provides a description and account of some of the changes involving 
the DP, namely anaphoric marking in English (and Scandinavian). I argue that 
Old English personal pronouns are not deictic/referential but that demonstrative 
pronouns have this function. This situation reverses itself in early Middle 
English, due to both internal and external factors. The internal factors are 

the shift towards the use of demonstratives before noun, i.e. the introduction 

of an article; the external factors are language contact that introduces new 
personal pronouns. I also look at Old Norse where the use of pronouns and 
demonstratives is similar to that in Old English. This remains the case to a lesser 
degree in its modern descendants. 


‘The structure of the DP is a very popular topic in Scandinavian linguistics. Having 
both a prenominal and postnominal ‘article’ makes Scandinavian unique among 
its Germanic relatives. The rich variation across Scandinavian languages has fu- 
eled much synchronic research. Faarlund (e.g. 2004, 2008) has added substantially 
to the debate on the structure of the Scandinavian DP but especially to its develop- 
ment through history. Various people (Abraham & Leiss 2007, Lohndal 2007, van 
Gelderen 2007) have responded to this work suggesting modifications. 

In this paper, I will build on the changes described for the DP in Scandinavian 
and English and see what happens to the anaphoric ‘system’ when the demonstra- 
tive is reanalyzed as an article and how that can be phrased in Minimalist terms. I 
use a Minimalist Feature Economy approach (van Gelderen 2008; 2011) to ac- 
count for the changes in demonstratives but the paper can be read without assum- 
ing such an approach. It may look as if the increase in features of the personal 
pronouns would go against feature economy but I argue that the shift towards a 
special feminine pronoun and a plural one in the history of English were caused 
externally, through Scandinavian influence. The shift is therefore not one where 
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one lexical item gains semantic features but one where a lexical item is replaced by 
another. The third person pronoun loses its reflexive character at the same time. 
These changes then enable the third person pronouns to shift topic and be more 
deictic. 

‘The system in Old Norse is very similar to that of Old English in that the third 
person paradigm is ‘mixed’ with the demonstratives. The pragmatic rules on when 
to use a pronoun or a demonstrative are similar as well. These rules are fluid as an 
example from Dutch, which frequently uses demonstratives in lieu of personal 
pronouns, shows. Modern Scandinavian is in between Old Norse and Dutch, with 
a less frequent use of demonstratives. 

The outline is as follows. In Section 1, I provide some background on the 
structure and function of the DP. In Section 2, I briefly describe what changes oc- 
cur in the English DP, based on van Gelderen (2011, 2013), and then go into the 
function of demonstratives and pronouns. Section 3 provides a preliminary look 
at the function of demonstratives and pronouns in other Germanic languages, 
e.g. Dutch, the Old Norse Poetic Edda, and Modern Norwegian. For practical pur- 
poses, I will restrict myself to demonstratives and pronouns that refer to animate 
antecedents (but note that s/he used to refer to inanimate nouns as well) and also 
to those pronouns used independently, i.e. without a noun following. 


1. Demonstratives and pronouns as DPs 


Cross-linguistically, if both a demonstrative (DEM) and article (ART) appear, 
the order is [DEM ART N] or [ART N DEM], according to Rijkhof’s 
(2002: 179-180) list of languages. Using a DP, as in (1), the [DEM ART N], at- 
tested in e.g. Abkhaz, Guarini, and Hungarian, would be the base order with the 
specifier in initial position and [ART N DEM], attested in e.g. Berbice Dutch 
Creole and Galela, would be a specifier-last structure. (It is of course also possible 
to use a right-branching structure only with (1a) as a base and left-ward move- 
ment of the article and the noun). 


(1) a. DP b. DP 
ee Aaiee Oe ae 
DEM D’ D’ DEM 
a EO. 
ART NP ART NP 


N N 
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Since the mid 1980s, this DP has served as the structure for the nominal (e.g. Abney 
1987), with the demonstrative in the Specifier of the DP, as in (2a), and the article 
in the head of the DP, as in (2b). 


(2) a. DP b. DP 
that D’ D’ 
NP D NP 
problem the problem 


‘There are restrictions on the co-occurrence of definite markers. Thus, in English, 
either the specifier or head can be present but not both, as (3a) shows, and that, but 
not the, appears independently, as (3bc) shows. 


(3) a. *That the dog loves their the toys. 
b. I saw that. 
c. *I saw the. 


Although the complementary distribution of the English demonstrative and arti- 
cle, shown in (3a), might lead one to think that they both occupy the same position 
in (2), the differences shown between e.g. (3b) and (3c) make that unlikely. The 
structure in (2) can be expanded with agreement and Case features, e.g. through a 
Num(ber), Phi, or a Kase Phrase, and I will do so below where necessary. 

Pronouns are also DPs but their exact structure is debated (see e.g. Cardinaletti 
& Starke 1996, Déchaine & Wiltschko 2002, and van Gelderen 2011). I will argue 
that first and second person pronouns consist of pure phi-features (person and 
number) whereas third person pronouns have more (deictic) features. I also argue 
that third person pronouns vary cross-linguistically in their features. They can 
include gender and can be +/- referential (deictic), as in (4). 


(4) a. PhiP b. DP 
Phi NP D PhiP 
I s/he ge es 
sthe NP 


In the analysis provided below, I depend on the features of demonstratives and 
pronouns and therefore quickly review what is assumed for Modern English. As- 
suming a minimalist analysis (e.g. Chomsky 2007), articles are clearly probes lo- 
cated in D, as in (5a), with uninterpretable features probing the phi-features of the 
noun. Since the has [u-phi], it cannot occur on its own, as shown in (3c). The de- 
monstrative can occur on its own, as in (3b), and I therefore assume it has inter- 
pretable person features ([i-ps]), as in (5b), or interpretable person and deictic 
features ([i-loc]), as shown in (5c). Number in (5bc) has to be checked, as I have 
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indicated,' but the exact probe on the demonstrative still has to be determined. 
That probe might be [u-#] in the NumP. 


(5) a. DP b. c. ee 
D NumP that NumP that NumP 
the Baer [i-ps] he es [i-loc] aan 
Num NP Num NP _ [i-ps] Num NP 
[u-phi] li-ps/#] [u-#] [i-ps/#] [u-#] [i-ps/#] 
[u-#] 


‘The phi-features of pronouns in (4) are interpretable for first and second but vari- 
able for third, as we'll see below. 

The function of the pronouns, (articles,) and demonstratives depends on the 
features. I show that Old English personal pronouns are typically used anaphori- 
cally to topical antecedents; demonstrative pronouns can shift topic and are refer- 
ential (deictic). 


2. Old English to Middle and to Modern English 


In this section, I show that Old English uses demonstratives widely: (a) optionally 
before nouns, (b) independently to refer to nouns, and (c) as relative pronouns. 
Personal pronouns are used (a) to refer to previous nouns and (b) as reflexives. 
Traugott (1992) and Kiparsky (2002) discuss this choice between personal and 
demonstrative pronoun in terms of the different discourse functions these ele- 
ments have. In late Old English, the relationship between demonstrative and per- 
sonal pronouns shifts. 


2.1 Old English 


Old English uses demonstratives to accompany nouns as in (6), as well as indepen- 
dently, as in (7). Note the use of he in (7) to refer to se. A third use is that of a rela- 
tive pronoun, as in (8). 


(6) hu 6a zbelingas ellen fremedon 
how DEM.NOM.P nobles-Nom.p courage did 
‘how those nobles performed heroic acts? (Beowulf 3) 


(7) X. Huelc se beonsceal de to reccenddome cuman sceal. 
which he be _ shall who to power come | shall 
“What kind of man he shall be who is to rule?’ 


1. Diessel (1999: 25) provides percentages of inflection on demonstratives in 68 languages 
that have inflection and shows that number marking is the most frequent, followed by gender 
and Case. 
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Table 1. Demonstratives in Old English 


Masculine Feminine Neuter Plural 
Nom se seo pet pa 
Gen pees peere pees para 
Dat pem peere pem pem 
Acc pone pa peet pa 


XI. Huelc se beon sceal se Ozrto cuman ne sceal. 
which he be © shall who thereto come not shall 
“What kind of man he shall be who is not to rule?’ 


XII Hu se se de  gedafenlice & endebyrdlice to cymé, 
how he who that properly and regularly to comes 
hu he deron drohtian scyle. 
how he thereon conduct should 
‘How he who properly and regularly attains thereto should conduct 
himself therein’ 
(Alfred, Pastoral Care, Hatton, 11. 18-21, Sweet edition) 


(8) AEghweepres sceal scearp scyldwiga gescad _—— witan worda 
every shall sharp shield-fighter difference know words 
ond worca se be wel bencep 
and works he that well thinks/judges 
‘Every sharp shield fighter, who judges well, must know the difference be- 
tween words and works: (Beowulf 287-9, Fulk et al’s edition) 


The (distal) demonstrative forms used are provided in Table 17; they are the same 
whether they are used pronominally or independently. 

‘The personal pronouns are used for reference to topical elements. For instance, 
the passage in (9) is about Scyld Scefing, the mythical ancestor of Danes, and he 
and him are used. A demonstrative pet is also used for extra emphasis or because 
it is an identificational demonstrative (according to Diessel 1999). 


(9) Oft Scyld Scefing sceabena preatum monegum megbum 
often Scyld Scefing shadows crowd many family 


meodosetla _ofteah egsode eorlas syd0an erest 
mead-benches away-took scared brave-men since _ early 
wear0 feasceaft funden he pees frofre gebad 


became poor found _he that consolation waited 


2. I will not discuss proximal demonstratives. 


200 Elly van Gelderen 


weox under wolcnum weordmyndum bah odpet 

grew underclouds honor grew until 

him eghwylc bara ymbsittendra ofer hronrade 

him every that sitting-around across sea 

hyran scolde | gomban gyldan bet wes god _ cyning 

obey should| tribute pay that was good king 

‘Scyld Scefing often drove away troops from the mead-halls and terrified 

many. He had once been a foundling but he found consolation and grew 

in glory until everyone had to pay tribute to him. That was a good king’ 
(Beowulf 4-11) 


Old English personal pronouns differ from their modern counterparts in that they 
can function as reflexives, as in (10) and (11), indicating a lack of referential or 
deictic properties. 


(10) No ic me an hereweesmun hnagran talige 
not I me in prowess smaller consider 
gubgeweorca, ponne Grendel hine 
war-deeds than Grendel himself 
‘By no means do I consider myself smaller in prowess and war-deeds than 


Grendel does himself. (Beowulf, 677-8) 
(11) him bebeorgan ne con 

him hide not can 

‘He could not hide himself? (Beowulf 1746) 


The introduction of self occurs in late Old English and is an internal change that 
starts with third person and then spreads to first and second person. In Northern 
Middle English texts such as the Ormulum it is frequent in all persons. 

Pro-drop, as in (11) and (12), also occurs in Old English, but for more on that 
see van Gelderen (2000; 2013). (12) is taken from a passage where the narrator 
talks about monsters stalking in the sea. 


(12) bet sydpanna ymb brotne ford brimlidende lade ne letton 
that since never on broad waterseafarers passage not let 
‘that they after that never kept people from passing that water’ 
(Beowulf 567-9) 


Pintzuk (1996) claims that pronouns are clitics and this would fit with reduced 
referentiality. The pronominal forms are found in Table 2, with the dual increas- 
ingly rare towards later Old English and the third person forms very phonologi- 
cally variable. 
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Table 2. Old English personal pronouns 


S Dual P 
First NOM ic wit we 
GEN min uncer ure 
DAT me unc us 
ACC mec uncet usic 
Second NOM pu git ge 
GEN pin incer eower 
DAT pe inc eow 
ACC pec incit eowic 
Third NOM he/heo/hit - hi 
(M/F/N) GEN his/hire/his - hira 
DAT him/hire/him ~ him 
ACC hine/heo/hit - hi 


As just mentioned, demonstratives and pronouns could both be used anaphori- 
cally. Traugott (1992) accounts for the choice between a pronoun and a demon- 
strative in terms of information structure, as does Kiparsky (2002). Kiparsky 
argues that Old English personal pronouns are not used deictically, do not head 
restrictive relative clauses, and are not used as predicates. Pronouns, according to 
him and Traugott (1992: 171), express the discourse topic whereas demonstratives 
indicate a change of topic. They both quote (13) as evidence for the different roles 
of the demonstrative and personal pronouns. ‘The first two pronouns hi and him 
continue previous topics but the demonstrative se changes the topic to the angel. 


(13) Hi habbad mid him awyriedne engel 
They have withthem corrupt angel 
mancynnes feond and se heefd andweald on... 
mankind’s enemy, and he [the angel] has power over... 
“They have with them a corrupt angel, the enemy of mankind, and he has 


power over .... (Alfric, Homilies ii.488.14, from Traugott 1992: 171) 


This complementary function can also be seen in (7) above and, for the most part, 
in (14) and (15), where I have not given a word-by-word gloss but have bolded the 
antecedents, the demonstrative se pronouns, and personal he/him pronouns. 


(14) pet fram ham gefreegn Higelaces begn, god mid Geatum, Grendles deeda 
‘Hygelac’s thane heard about Grendel's deeds while in Geatland’ 
se wees moncynnes megenes strengest on bem deege pysses lifes 
‘he (=Hygelac’s thane) was mankind’s strongest man on earth’ 
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epele ond eacen. Het him ydlidan godne gegyrwan, cwed, 

‘noble and powerful. (He) ordered himself a good boat prepared and 
said’ 

he gudcyning ofer swanrade secean wolde, meerne peoden, 

‘that he wanted to seek the king over the sea’ 

ba him wes manna pearf. done sidfeet him snotere ceorlas lythwon logon, 
‘since he (=the king) needed men. Wise men didn't stop him (=Hygelac’s 
thane)’ 

beah he him leof were. 

‘though he was dear to them? (Beowulf 194-98) 


In (14), the first demonstrative se (line 2) can be used because the antecedent 
(Hygelac’s thane) is no longer clear (Grendel is a possible antecedent) or for em- 
phatic reasons. A subject is left out before het because the topic is the same and 
repeated through the personal pronoun he. The topic shifts to the gudcyning which 
is then referred to as he. After this, the reference of him and he shifts back to Hyge- 
lac’s thane without the use of a demonstrative. 

In passage (15), from a later work, each antecedent is introduced before it is 
referred to by a personal pronoun, as expected. 


(15) pis eerendgewrit Agustinus ofer sealtne sce sudan brohte iegbuendum, 
“This message, Augustine brought over the salty sea from the south to the 
islanders’ 
swa hit er fore adihtode dryhtnes cempa, Rome papa. 
‘as the Lord’s champion before had decreed it, Rome's pope’ 
Ryhtspell monig Gregorius gleawmod gindwod 6uth sefan snyttro, 
“The wise Gregorius was versed in many true doctrines through the wis- 
dom of his mind’ 
searodonca hord. Fordceem he monncynnes meest gestriende 
‘his hoard of cunning thought. Because he gained over most of mankind, 
rodra wearde, Romwara betest, monna modwelegost, meerdum gefregost. 
‘the guardian of heaven, Rome’ best, wisest of men, most gloriously 
famous’ 
Siddan min on englisc eelfred kyning awende worda gehwelc, 
‘Afterwards King Alfred translated every word of mine into English’ 
and me his writerum sende sud and nord, heht him swelcra ma brengan 
‘and he sent me to his scribes south and north. He ordered more to be 
brought to hin’ 
bi dere bisene, dcet he his biscepum sendan meahte, fordcem hi his sume 
dorfton, 
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‘after that example that he might send these to the bishops because they 
needed it’ 
da e laedenspreece lceste cudon. 
‘who knew little Latin? 
(Alfred, Pastoral Care, Hatton, 9. 8-16, Sweet edition) 


The reasons behind the distribution of the pronoun and demonstrative are not 
well understood (a similar indeterminacy involves explaining exactly when pro- 
drop appears in e.g. Spanish). In Old English, demonstratives don’t always change 
the topic and pronouns don't always continue the topic. Thus, (a) the demonstra- 
tive in (14) above and the second se in (16) are emphatic but dont change the 
topic, (b) the pronouns he in (14) and (16) shift the topic, and (c) the number of 
pronouns is much higher than that of demonstratives. 


(16) Da et nehstan se cyning, se de Seaxna gereorde an cude, 
“Then the king, who knew only Saxor’ 
wes adroten his elreordre sprece: aspon pa in Westseaxe 
‘was tired of his (=the bishop’s) foreign speech (and) invited into Wessex’ 
oderne biscop, se de his gereorde cude: 
‘another bishop who knew his (=the king’s) language’ 
se wees Wine haten & se wes in Gallia rice gehalgod 
‘He (=the new bishop) was wine called and he was in Gaul consecrated’ 
Ond he pa todelde in twa biscopscire Westseaxna megoe, 
‘And he (=the king) then divided Wessex into two bishoprics’ 
ond beem Wine gesealde biscopsedl in Wintaceastre. 
‘and he gave the bishopric in Winchester to Wine’ 
(Bede, Miller edition, 186-70) 


I will not attempt a refinement of the distribution of the two types of pronouns but 
will focus on the changes starting in late Old English and account for those. 


2.2 Late old and early Middle English 


In this section, I first outline the changes (in 2.2.1) that are happening to the 
pronominal system and then provide a case study of these changes in one text 
(in 2.2.2). 


2.2.1 The changes 
In late Old English, the distal demonstrative pronoun se is reanalyzed as the defi- 
nite article. The first clear articles based on the demonstrative appear in the middle 


204 Elly van Gelderen 


of the 12th century’. The evidence for the reanalysis is first that se and its variants 
need to be followed by a nominal (Wood 2003: 69) whereas beforehand this form 
had been optionally followed by a noun and secondly most nouns are now pre- 
ceded by a demonstrative or article whereas earlier they appear on their own. 
Good examples of articles come from the Peterborough Chronicle and the slightly 
later Ormulum, as in (17). 


(17) & gaddre-sst swa pe clene corn Allfra pe chaff togeddre 
and gather-2s so the clean wheat all from the chaff together 
‘and so you gather the clear wheat from the chaff? 
(Ormulum 1484-5, Holt edition) 


Other shifts in the pronominal system occur. The Old English third person pro- 
nouns show an initial h- (see Table 2), but are very variable (hi can be singular and 
plural, etc.). In the late Old English period, a new third person feminine singular 
pronoun she and third person plural they appear. This change occurs in the middle 
of the 12th century, in fact in the same texts as the articles first appear. 

The origin of the feminine she is controversial. One theory is that it derives 
from the demonstrative seo/sio and another that it first appears in areas with Norse 
contact. According to the OED, 


the districts in which she or sho first appears in the place of heo are marked by the 
abundance of Scandinavian elements in the dialect and place-names; and in Old 
Norse the dem. pron. (of all genders) is often used as a personal pron. It is also 
noteworthy that in OS. and OHG. the fem. pers. pron. nom. sing. was siu (mod.G. 
sie, Du. zij), corresponding to OE. sio (the oblique cases, and the masc. and neut. 
in the sing., being f. the stems hi-, i-); and in OFris. se ‘she’ occurs beside hiu. 


In the Peterborough Chronicle, females are referred to infrequently by heo, seo, and 
sce. Between the years 1090 and 1140, for instance, I found one demonstrative seo 
used for ‘she; as in (18). After that, the form changes to sce, as in (19), which the 
OED cites as the first instance of modern she. 


(18) pcereefter toforan Candelmessan on Windlesoran him to wife forgyfen Adelis 
e& syddan to cwene gehalgod. seo wes bes heretogan dohtor of Luuaine. 
‘thereafter before Candlemas at Windsor was given him Adela as wife and 
after (she was) hallowed queen. She was the duke of Louvain’s daughter: 

(PC 1121, Thorpe edition) 


(19) And te Lundenissce folc hire wolde teecen & sce fleh. 
‘And the people of London wanted to take her, and she flew’ (PC 1140) 


3. Although Christophersen (1939: 86-7) claims it was much earlier. 
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(20) He brohte his wif to Engleland. & dide hire in pe castel on Canteberi. God 
wimman scee wees. oc scee hedde litel blisse mid him. & Crist ne wolde dcet 
he sculde lange rixan. ... & his sune Henri toc to pe rice. & te cuen of France 
todeelde fra be king. & scee com to pe iunge eorl Henree;. & he toc hire to 
wiue. 

‘He brought his wife to England and put her in a castle in Canterbury. She 
was a good woman but she had little bliss with him. And Christ didn’t 
want him to have a long rule ... and his son Henry took the throne and the 
queen of France parted from the king and she came to the young count 
Henry and he took her to wife: (PC 1140) 


In the Ormulum, from 1200, females are referred to more frequently and always 
through 3ho, as in (21). 


(21) 3ho wass ... Elysabeep 3ehatenn 
‘She was called Elisabeth’ (Ormulum 115) 


The introduction of they is, according to the OED and most handbooks, due to 
Scandinavian influence. This fits the time period and texts in which it was first 
introduced. 


(22) & swa pe33 leddenn heore lif Till batt te33 weerenn alde 
‘and so they led their lives until they were old? (Ormulum 125-6) 


From the late Old English through the Middle English period, special reflexives 
are also introduced (see van Gelderen 2000) and the Ormulum uses them fre- 
quently (though the Peterborough Chronicle does this much less). I see this as extra 
evidence for the anaphoric strengthening of the personal pronoun. 

Personal pronouns also start being used demonstratively, as the OED puts it, 
in (23). 


(23) For he and he had samen ben, forwit selcuth wrath. 
‘Because they had together been debating’ 
(Cursor Mundi, Cotton 16161, Morris edition) 


Thus, the demonstratives and pronouns are no longer helping dissolve pragmatic 
ambiguity. Up to the current period the excessive use of personal pronouns leads 
to ambiguous reference, although there is an alternative. (The) latter is first used in 
the 11th century to mean ‘later’ in time, according to the OED, and (the) former is 
used in the 12th century to mean ‘earlier’ in time. The early Middle English (24) 
and (25) seem to me to be the earliest examples and this is a text we have encoun- 
tered earlier as using articles and she and they. 
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(24) Rihht swa summhe be forrme mann Ar _ oferrcumenn haffde 
just so as he the former man earlier overcome had 
(Ormulum 12494-5) 


(25) pin forrme win iss swipe god, bin lattre win iss bettre. 
‘Your earlier wine is very good, your later wine is better? 
(Ormulum 15409) 


So, the situation in Old English is one where pro-drop occurs, the personal pro- 
noun is used to refer to a continuing topic, and also reflexively, but not as a relative 
pronoun. The demonstrative, by contrast, is used as relative pronoun and (often) 
for topic-shifting. The changes that occur at the end of the Old English period are 
a reanalysis of the masculine singular demonstrative as article (through internal 
change), an introduction of new third person pronouns (through external change, 
ie. contact), and the introduction of the reflexive (through internal change). 
Figure 1 summarizes these. 

In other work (van Gelderen 2011), I have argued that grammaticalization, 
such as the reanalysis of a demonstrative as an article, involves a loss of interpre- 
table features. Without going into that in too much detail, I would like to suggest 
that the Old English demonstrative se was reanalyzed without deictic interpretable 
features, as in (a) of Table 3, and also without gender and number. (‘The [u-T] 
features represent the case features but nothing hinges on that). The changes in 
(b) are external, triggered by the new plural and feminine third person forms. 

In 2.2.1, I have provided a scenario for changes involving demonstratives and 
pronouns. In the next subsection, I present a study of demonstratives and pro- 
nouns in a transitional text. This will also show that the be in e.g. (17) is indeed an 
article. 


2.2.2 The Peterborough Chronicle 

The language of the Peterborough Chronicle is often seen as representing the change 
from Old to Middle English. The main change comes around the entry for the year 
1122, when the scribe starts adding new information (where before he was copy- 
ing from an earlier manuscript); a second change starts with the year 1132 when 


Internal External 

se -- >the seo -- > she 
that- - >that hi --> they 
him/her- - -» him/herself 


Figure 1. Changes in Late Old English 
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Table 3. Changes in terms of features 


Old English Middle English 
a. se > the 
[i-loc]/[i-phil] {u-T]/[u-ps] 
b.  he/hi is replaced by he 
heo/ha is replaced by she (possibly via seo) 
hi/hie is replaced by they* 
[i-phi] [i-phi]/[i-loc] 


the second scribe takes over. There are also some additions that the first scribe 
made throughout the text. I will examine an excerpt from the beginning of the 
Chronicle (the preface), i.e. without articles and with few demonstratives, an ex- 
cerpt from the changing part (the year 1130), i.e. with articles and demonstratives, 
and then for a year in the Final Continuation (the year 1137) with clear articles 
and also new pronouns (see (19) and (20)). 

In (26) and following, from the beginning of the Peterborough Chronicle, the 
pronouns are in bold and the translation is from the online medieval and classical 
library (http://omacl.org/Anglo/partl.html) so as not to be biased by my own 
translation on the use of articles etc. In (26), the first mention of Scottas has no 
demonstrative preceding it but the second does, and in (27), the Pihtas also get a 
demonstrative since they have been mentioned before, as does the land in (27). As 
for pronoun use in (26), there is a pro-drop to the topic of the passage and the first 
hi goes back to the topic. The next instances of hi show that they do not straight- 
forwardly refer back to the topic: the second hi switches the topic, the subsequent 
hi doesnt, and the last hi refers to both groups of people. 


(26) & bacoman erost on norp Ybernian up. & beer beedon Scottas 
‘and, landing first in the northern part of Ireland, (they =Picts) told the 
Scots’ 
beet hi ber moston wunian. Ac hi noldan heom lyfan. 
‘that they (=Picts) must dwell there. But they (=Scots) wouldn't give them 
leave’ 
forpan hi cweedon peet hi ne mihton ealle eetgedere gewunian peer. 
‘for the Scots told them that they (=Scots and Picts) could not all dwell 
there together; 
& ba cwedon pa Scottas. we eow magon beah hwadere red geleron. 
‘But, said the Scots, we can nevertheless give you advice: 


4. There are many other forms but I just give the most frequent ones. 
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(27) We witan oper egland her be easton. ber ge magon eardian gif ge willad. 
“We know another island here to the east. There you may dwell, if you 
will 
e& gif hwa eow widstent. we eow fultumiad. bet ge hit magon gegangan. 
‘and whosoever withstandeth you, we will assist you, that you may gain 
it 
Da ferdon pa Pihtas. & geferdon pis land norpanweard. 

“Then went the Picts and entered this land northward’ 
e& supanweard hit hefdon Brittas. swa we cer cwedon. 
‘and Southward the Britons possessed it, as we before said’ 


For the transitionary period (the First Continuation), I randomly selected the be- 
ginning of the entry of the year 1130. There is a real increase in demonstratives 
and these demonstratives are often phonologically lighter. Notice that all the nom- 
inals in (28) are preceded by a D-element; the names are not since they are them- 
selves definite. 


(28) Dis geares wes se mynstre of Cantwarabyri halgod fram pone cercebiscop 
“This year was the monastery of Canterbury consecrated by the Archbishop’ 
Willelm pes deies iiii Nonce MAI. Der weron pas biscopes. 

‘William, on the fourth day before the nones of May. There were the 
Bishops’ 

Tohan of Roueceastre. Gilbert Uniuersal of Lundene. Heanri of Winceastre. 
Alexander of Lincolne. Roger of Seresbyri. Simon of Wigorceastre. Roger of 
Couentre. Godefreith of Bathe. Eourard of Noruuic. Sigefrid of Cicaestre. Ber- 
nard of Sancti Dauid. Audoenus of Euereus of Normandige Iohan of Seis. 
‘John of Rochester,Gilbert Universal of London, Henry of Winchester, 
Alexander of Lincoln, Roger of Salisbury, Simon of Worcester, Roger of 
Coventry, Geoffry of Bath, Evrard of Norwich, Sigefrith of Chichester, 
Bernard of St. David’s, Owen of Evreux in Normandy, John of Sieyes: 


In (29), all arguments are preceded by a demonstrative except Sancti Andreas myn- 
stre ‘St. Andrews monastery’ but this is because Sancti Andreas functions as D. 
Inside PPs, there is typically no D, an indication that the demonstrative is used for 
structural Case. 


(29) Des feorde deeges beercefter wees se king Heanri on Roueceastre. 
‘On the fourth day after this was the King Henry in Rochester’ 
e& se burch forbernde elmeest. & se eercebiscop Willelm halgede 
‘when the town was almost consumed by fire; and the Archbishop William 
consecrated’ 
Sancti Andreas mynstre & 6a forsprecon biscop mid him. 
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‘the monastery of St. Andrew, and the aforesaid bishops with him’ 
e& se kyng Heanri ferde ouer se into Normandi on heruest. 
‘And the King Henry went over sea into Normandy in harvest? 


In (30) and (31), all the arguments are preceded by demonstratives. The only ex- 
ception seems to be the quoted proverb. The prepositional objects, such as ouer se 
and on heruest in (29), lack a demonstrative or article. In (30), there are two pos- 
sible instances of pro-drop and two he pronouns and their use as expected. The 
same is true in (31). 


(30) Des ilces geares com se abbot Heanri of Angeli cefter esterne to Burch. 
“This same year came the Abbot Henry of Angeli after Easter to Peterbor- 
ough’ 

e& seide pet he heefde forleten bone mynstre mid ealle. 

‘and (he) said that he had relinquished that monastery withal’ 
ZEfter him com se abbot of Clunni Petrus gehaten to Englelande 
‘After him came the Abbot of Clugny, Peter by name, to England’ 
bi bes kynges leue & wees underfangen ouer eall swa hwar swa 

‘by the king’s leave; and was received by all, whithersoever 

he com mid mycel wurdscipe. 

‘he came, with much respect? 


(31) To Burch he com & ber behet se abbot Heanri him peet he 
“To Peterborough he came; and there the Abbot Henry promised him that 
he 
scolde beieton him pone mynstre of Burch beet hit scolde beon 
‘would procure him the ministry of Peterborough, that it might be’ 
underéed into Clunni. Oc man seid to biworde. hege sitted 
‘subject to Clugny. But it is said in the proverb, The hedge abideth’ 
ba aceres deleth. God elmihtig adylege iuele rede. 
‘that acres divideth. May God Almighty frustrate evil designs: 
e& sone beereefter ferde se abbot of Clunni ham to his erde. 
‘Soon after this went the Abbot of Clugny home to his country’ 


The real change comes in the final part: demonstratives are used pragmatically in 
the early parts but grammatically, marking subjects and objects, in the later parts. 
Some examples from this entry are given in (32) and (33), which are from the start 
of the entry for the year 1137. Note that names such as be king Stephne and Henri 
king show that the article is in complementary distribution with the preposed 
name, i.e. the article and name are both in D. The articles occur in subject and 
object position; the pronouns are used as in Modern English. 
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(32) dis gcere for pe king Stephne ofer sce to Normandi & ther wes underfangen 
“This year, (the) King Stephen crossed the sea to go to Normandy and was 
received 
forpi dct hi uuenden det he sculde ben alsuic alse the eom wes. 

‘there because they thought he was like the uncle (i.e. his uncle)’ 

e& for he hadde get his tresor. ac he todeld it & scatered sotlice. 

‘And because he still had his treasury, but he divided and scattered it 
stupidly’ 

Micel hadde Henri king gadered gold & syluer. & na god ne dide me for his 
saule thar of. 

‘King Henry has gathered much gold and silver and no good did men with 
it for his soul? 


(33) Da pe king Stephne to Englalande com ba macod he his gadering cet 
Oxeneford. 
‘When King Stephen came to England, he held a gathering at Oxford’ 
e& bar he nam pe biscop Roger of Sereberi & Alexander biscop of Lincol 
‘and there he took bishop Roger of Salisbury and Alexander bishop of 
Lincolr’ 
e& te Canceler Roger hise neues. & dide elle in prisun. til hi iafen up here 
castles. 
‘and the chancellor Roger, his nephews. And put all in prison until they 
gave up their castles’ 


The three stages analyzed in 2.2.2 show an initial increase in demonstratives and 
then the reanalysis of demonstratives as articles. The difference between the use of 
demonstratives in (26) and (27) on the one hand and in (32) and (33) is very no- 
ticeable. Personal pronouns are sometimes used as topic shifters and, from (19) 
and (20), we know that a special feminine pronoun she was being introduced in 
the last part. 


2.3. Middle and Modern English 


We have seen that the masculine demonstrative se is reanalyzed as the definite 
article, e.g. in (17). The neuter that becomes the distal demonstrative (initially in 
complementary distribution with the article°), as in (34) and (35). 


(34) 7 0 patt allterr haffdenn pe33 Glowennde gledess 3arkedd. 
‘and on that altar had they prepared glowing coals? (Ormulum 1066-9) 


5. In Ormulum and the Peterborough Chronicle, the article is used when assigned structural 
Case and the demonstrative in oblique position, e.g. it appears after a preposition. 
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(35) pe Laferrd haffde litell rum Inn all patt miccle riche. 
“The lord had little room in all that great kingdom? (Ormulum 8489-90) 


The independent use of the demonstrative, as in (36), from the early 14th century 
continues. 


(36) Pryue synne and sacrylage, That loue y moste. 
‘Secret sin and sacrilege. That, I love the most. 
(Handlyng Synne 12560, from the OED) 


Middle and Modern English use personal pronouns, even when the topic is not 
clear, as in (37), or the former/latter, but not a demonstrative as in (38) if it refers 
back to a noun. 


(37) Obama met Putin at the Kremlin. He wanted to discuss Afghanistan. 
(38) Obama met Putin at the Kremlin. *That (one) wanted to discuss Afghani- 
stan. 


As mentioned, since the early Middle English period, the former and the latter have 
been used in formal writing/speech to disambiguate the antecedent, as in (39). 


(39) Wisedome and Fortune combatting together, If that the former dare but 
what it can, No chance may shake it. 
(Shakespeare, Antony & Cleopatra III. xiii. 80, from the OED) 


Modern English has a limited use of independent demonstratives. 

In conclusion to Section 2, I have shown that changes occur in demonstra- 
tives, the main one being the increase in use of the demonstratives and their re- 
analysis as articles. As a result of this and external changes in the pronoun system, 
pronoun ended up doing some of the duties that demonstratives used to do. 


3. Pronominal variation: Dutch and Scandinavian 


In this section, I provide a few comparisons between English, Dutch, and the 
Scandinavian languages. The latter is partly motivated by a possible influence of 
Old Norse on Old English, e.g. in the Ormulum and the Peterborough Chronicle. 
This is obviously a preliminary look. Contact may have accelerated the grammati- 
calization of the demonstratives but the actual realignment of the functions of the 
demonstratives and personal pronouns is unique to English. 

There has been sporadic work on the difference between pronouns and demon- 
stratives in the other Germanic languages (Kirsner 1979, Comrie 2000, van Gelderen 
1998, and Petrova & Solf 2010). For instance, Comrie (2000) argues that, in Dutch, 
demonstrative pronouns are used to refer to non-topical antecedents, very similar to 


212 Elly van Gelderen 


the situation in Old English discussed above, whereas personal (and possessive) pro- 
nouns refer to topical ones. Dutch reference grammars (e.g. Geerts et al 1984: 215;219) 
make similar observations about the demonstrative introducing new information 
and being used for emphasis. To show some of the complexities of this system, take 
a look at (40), which has Brandaan as the topic referred to with the first hij ‘he: The 
two demonstratives deze ‘this one’ shift the topic to the giant, but the second deze 
could be replaced by hij. The instances of hij cannot be replaced by deze. 


(40) Op de dag van de afvaart, vond hij het hoofd van een dode man op het 

strand. Het was een zeer groot hoofd en Brandaan vroeg aan het hoofd 
wat hem was overkomen. Deze antwoordde dat hij vroeger ver de zee in 
liep (hij was wel honderd voet lang) om schepen te beroven, maar op een 
dag kwam er een vloedgolf, waardoor hij gedood werd. Brandaan vroeg 
nu, of hij God moest bidden om de reus weer te laten leven, als deze 
Christen wilde worden. De reus sloeg dit aanbod af, zeggende dat hij dan 
weer moest sterven en dat hij nu minder gestraft werd door de duivel, 
omdat hij voordien nooit van Christus gehoord had en dus ook niet kon 
weten hoe hij had moeten handelen. 
‘On the day of the trip, he found the head of a dead man on the beach. It 
was a very large head and Brandaan asked the head what had happened to 
him. This/he answered that he used to go out very far into the sea (he was 
100 feet tall) to rob ships but that one day a wave had come where he had 
been killed. Brandaan asked if he (=Brandaan) should pray to God to let 
the giant live again if this/he (=the giant) would become a Christian. The 
giant turned the offer down saying that he would have to die again and that 
he would be punished less now by the devil because he (=the giant) had 
never before heard of Christ yet and was ignorant to know how he should 
have behaved:(from home.planet.nl/~werff632/boeken/brandaan.htm) 


In Old Norse, there are a number of anaphoric strategies, e.g. both the demonstra- 
tive sa and pronoun hann are used in (41) from the Poetic Edda. This seems to 
follow the pattern in Old English and Dutch. 


(41) Ominnis hegri heitir sa er yfir alprom prvmir, 
forgetting heron called he who over beer _quiet-lies; 
hann stelr gedi gvma 
he — steals mind man 
‘Forgetting is the name men give to the heron/bird; he steals the minds of 
man over beer? (Havamal 13; http://etext.old.no/Bugge/havamal.html) 


Faarlund (2004: 35-6) provides a list of the demonstrative and personal pronouns 
in Old Norse, as in Tables 4 and 5. 
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Table 4. Demonstratives in Old Norse 


Singular Masculine Feminine Neuter Plural Masc Fem Neu 
Nom sa su pat peir per pau 
Gen __ bess peirar bess peira peira peira 
Dat peim peiri pvi peim peim peim 
Acc pann pa pat pa peer pau 


Table 5. ON personal pronouns 


Singular Dual Plural 
First NOM ek vit ver 

GEN min okkar var 

DAT mer okkr oss 

ACC mik okkr oss 
Second NOM pu it er 

GEN pin ykkar yoar 

DAT per ykkr yor 

ACC pik ykkr yor 

reflexive 

Third NOM hann/hon - - - 
(M/F) GEN hans/hennar - = sin 

DAT honum/henni - ~ ser 

ACC hann/hana - = sik 


There are differences and similarities between Old English and Old Norse in the 
morphology and in the organization of the paradigm. A notable difference is that 
third person pronouns are not used reflexively but that a special reflexive form is 
used (sik in the accusative). Like Old English, however, the paucity of forms is 
obvious in the third person paradigm: there are no neuter and plural third person 
pronouns because the demonstrative is used, as shown in (42) for peir ‘they’ 
(I have also bolded the other pronouns). 


(42) Osnotr mapr hyger ser alla vera vidhloiendr vini hitki hann fipr, 
fool man thinks his all be laugh friend that-one he finds 
bot beir vm hann far lesi, ef hann mep snotrom sitr. 
although they around him evil contrive when he with smart sits 
“The fool thinks that those who laugh at him are all his friends although 
when he sits with wiser men he hears they speak ill of him’ 
(Havamal 24; http://etext.old.no/Bugge/havamal.html) 
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The situation in the Modern Scandinavian languages is quite different. Wessén 
(1968) writes that third person pronouns and demonstratives are close in meaning 
(“kommer till betydelsen varandra nara” 1968:51) and that mostly both can be 
used but that demonstratives give more clarity (“ger store tydlighet” 1968: 64) 
whereas the pronoun continues the subject of a preceding clause, as in e.g. (43), 
from Norwegian. 


(43) .. tenkte Helge og smilte -- for der han stod med frakkekraven brettet op 
for kulden, hadde han folt en underlig historisk stemning stryke gjennem 
sig. Men saa tok han til at nynne med ... og fortsatte nedover gaten i den 
retning, som han visste, Corsoen skulde ligge.. 

‘thought Helge and smiled — because there he stood with his coat-collar 
turned up against the cold, a special historical atmosphere had come to 
him. But then he started to hum along ... and continued in the direction 
where he knew the Corso was: (Sigrid Undset Jenny, p. 1) 


Faarlund et al (1997: 210-212; 317) explain that the demonstratives denne and den 
are common in bokmal and nynorsk, two major varieties of modern Norwegian, 
and refer to persons or things. In most pedagogical and descriptive grammars of 
Swedish and Norwegian, the list of pronouns is given as in Table 6 for Swedish; the 
den, det, de, and dem forms also function as demonstratives next to more em- 
phatic ones such as den har ‘this here’ 

Searching some Swedish and Norwegian sites, the den is not used to refer to 
persons, however. Den, in e.g. (44), always refers to a book or a video. Reading 
modern Scandinavian as a Dutch native speaker, the personal pronoun definitely 
predominates, as (43) shows. 


(44) Den har jag inte ... Swedish 
“That have I not .. 


Different varieties have slightly different systems. For instance, nynorsk uses han 
‘he and ho ‘she also to refer to non-human antecedents. 


Table 6. Third person pronouns in Swedish 


subject object 
3MS han honom 
3FS hon henne 
3S (common) den det 
3S (neuter) det det 


3P de dem 
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4. Conclusion 


In Old English, we have a system of pronominal reference that differs significantly 
from that of Modern English. Both personal pronouns and demonstratives can be 
used anaphorically but with different functions in terms of information structure. 
One of the demonstratives, the masculine singular se, is reanalyzed as article; an- 
other, the neuter singular that, is reanalyzed as the regular demonstrative. Replacing 
the independent demonstrative are the externally renewed personal pronouns. Once 
the appears, it cannot be used independently and certainly not as topic shifter. 

Pronouns vary enormously between languages. The features can be interpre- 
table or uninterpretable,the pronoun can occupy a specifier or head position, and 
have person as well as deixis features. Old Norse and Modern Dutch are similar to 
Old English but in Modern Scandinavian the demonstrative is used less to inde- 
pendently refer to persons. 


Abbreviations 

F feminine 

i- interpretable 

M masculine 

OED Oxford English Dictionary (online version used) 
P plural 

PC Peterborough Chronicle 
ps person 

S singular 

u- uninterpretable 

# number 
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Blevins and Garrett (1998) investigate in detail the origins of CV/VC 

metathesis in a number of languages and identify two types of metathesis and a 
“pseudometathesis”. For them, “pseudometathesis” is a synchronic process that 
does not originate through the historical process of metathesis. They analyze 
languages in which “pseudometathesis” originates through epenthesis and 
deletion (1998) or through reinterpretation and generalization of other processes 
in the language (Garrett and Blevins 2009). I argue here that metathesis in Batsbi 
originates as a result of grammaticalization, together with regular phonological 
processes. When a function word, such as an auxiliary, grammaticalizes as an 
affix on a base, affixes trapped between the base and new affix are often lost 
(Harris and Faarlund 2006). However, in Batsbi some trapped affixes were not 
immediately lost, and I argue that this is the source of the variable position of the 
present tense marker, and that its variable position was reanalyzed as metathesis. 
I argue further that the reanalyzed process is true metathesis synchronically. 


Metathesis is a process by which two segments change order, with X(Y)Z becom- 
ing Z(Y)X. While metathesis was long considered an irregular process occurring 
only infrequently in languages, recent work such as Blevins and Garrett 1998 and 
Hume 2001 has shown that the process is not as rare as had been believed and that 
it is often entirely regular. 

Blevins and Garrett (1998, 2004) identify four general types of metathesis and 
three sources of what they term “pseudometathesis’, that is, metathesis alterna- 
tions other than ones originating in sound change. 
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(1) Sources of metathesis alternations other than sound change 
a. Loan adaptation 
b. Telescoping (epenthesis + deletion) 
c. Analogical morphophonology (Blevins & Garrett 2004: 142) 


In Batsbi we find metathesis of the kind illustrated in (2), where -d precedes the 
present tense marker, -o, in the first and second persons, but follows it in the 
third. 


(2) lev-d-o-s ‘Tsay’ 
lev-d-o-(h) ‘you say’ 
lev-o-d ‘s/he says’ 


I argue here that metathesis in Batsbi originated in a manner not included in (1), 
from a syntactic construction involving a main verb and an auxiliary. When the 
auxiliary grammaticalized as a transitive marker, a present tense marker was 
trapped between the main verb, on which it occurred as a suffix, and the gram- 
maticalizing auxiliary. This stage is attested in (3) and many other examples, where 
-o is the trapped tense marker and -” reflects the present tense marker originally 
on the auxiliary, though it has been reduced (from /o/) by regular processes. 


(3) xe-n-mak-re” gor-i ak’-o-d-” ! (Kadagize & Kadagize 1984) 
tree-paT-on-from apple-pL.aBs throw-PRES-CM-PRES 
“They are throwing apples from (on) the tree? 


Examples like that in (3) are not accepted by younger speakers today, who use 
forms like those in (2) instead. I believe that this is the first paper proposing an 
instance of metathesis originating in the grammaticalization of a syntactic pattern, 
and it is hence fortunate that the development is attested. 

In this paper, I describe the structure of verbs in Batsbi ($1) and metathesis 
(§2) in the present tense forms of transitive verbs and consider how this metathesis 
should be classified according to the typology above ($4). I argue ($3) that Batsbi 
metathesis originated through grammaticalization with retention of the trapped -o 
of the present tense, together with subsequent changes. 


1. The following abbreviations are used in this paper: ABs absolutive case, ABSL absolutive 
verb form, ALLII allative II, aor aorist, AUX auxiliary, cm class marker, paT dative, ERG ergative 
case, EX(CL) exclusive, FUT future, GEN genitive, IMPF imperfect, IMPFV imperfective, IN(CL) 
inclusive, INTR intransitive, NEG negation, OBL oblique stem formant, pt plural, pREs present, Q 
question, sG singular, TR transitive, vN verbal noun. The gender-number markers required by 
nouns are given in some examples in parentheses after the meaning of the root, with the singular 
form before the slash, and the plural after. CMs are in bold. ** indicates ungrammaticality. 
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1. Basic verb morphology in Batsbi 


Batsbi (also known at Bats or Tsova-Tush) is a member of the Nakh-Daghestanian 
family, most closely related to Chechen and Ingush. It is spoken in the village of 
Zemo Alvani in eastern Georgia and is severely endangered, with the youngest 
speakers born around 1965. 

Some verbal lexemes require a gender-number prefix, while others do not per- 
mit one. Only verbs that begin with a vowel or /%/ may have a gender-number pre- 
fix; (4) shows that not all vowel-initial verbs occur with a gender-number prefix. 


(4) d-ahar ‘take; bring; castrate’ ahar ‘steal; grind’ 
d-eblar ‘place; lay; put down in’ _ eblar ‘thread, put’ 
d-ebcar ‘tie’ ebc’ar ‘push; weigh; milk’ 
d-ot:ar ‘pour into’ ot:ar ‘stand, stay’ 
d-ot'ar ‘go; go over’ otar ‘spread’ 


As we can see from the minimal pairs in (4), presence of a gender-number prefix 
is lexically governed. In the Caucasus, these prefixes are traditionally referred to as 
class markers, and here they are glossed as CMs. 

Some transitive verbs are derived from intransitive roots by means of the suffix 
-i, which is obligatorily preceded by a CM. The suffix -i itself surfaces only in the 
aorist (6) and tenses based on it; in other forms it is not realized. (5) presents ex- 
amples with and without a prefixal CM; all examples in (5) contain the CM 
(in bold) that accompanies the transitivizer, which here is in its zero realization. 


(5) d-as-d-ar [cmM-melt-cM-vNn] ‘melt; soak’ 
d-ek-d-ar [cmM-cry-cm-vn] ‘cause to cry, plead’ 
d-ol-d-ar [cm-begin-cm-vn] ‘begin, start’ 
lak-d-ar [throw-cmM-vn] ‘throw, toss 
tes-d-ar [convince-CM-vN] ‘convince; confide 
qol:-d-ar — [prop-cm-vn] ‘prop up, support’ 


Every verb form must have an additional suffix, either a tense marker, such as -o, 
the formant of the present tense for all transitives and some intransitives, or a 
marker of a non-finite form, such as the -ar that forms the verbal nouns in (5). 
Some forms have more than one such suffix. For example, the imperfect tense is 
formed on the present stem and thus uses both the present suffix -o and the for- 
mant of the imperfect, -ra; this is discussed further below. Participles and abso- 
lutes are formed from tensed verb stems and are likewise discussed further below. 

Case marking in Batsbi is basically ergative-absolutive. (6) shows that in tran- 
sitives subjects are ergative, and direct objects are absolutive. 
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(6) manana-s oqui” naq bist’-en kor-ui 
Manana-eErc his/her.GEN friend(y/d)-patT hand(d/d)-PL.aBs 
cukba-d-i-e” 
give-CM-TR-AOR 
‘Manana, gave gloves (lit. hands) to her, friend’ 


While subjects of intransitives were absolutive in Proto-Nakh, in modern Batsbi 
first and second person subjects of some verbs are absolutive, while other are erga- 
tive. Third person subjects of intransitives are absolutive (see Holisky 1987 for 
details). 


(7) a. (so) qiul y-at-e"-s” 
LABS poverty CM-fall-Aor-1sG.ABs 
‘T lost weight (by accident)’ 
b. (as) qiul — y-et-n-as 
LERG poverty CM-fall-aor-1SG.ERG 
‘T lost weight (on purpose): 


(Note that the expression ‘lose weight’ is literally ‘poverty fall. The change in the 
root vowel in (7b) is due to the regular syncope of the vowel /e/ from the aorist 
suffix, /-en/.) 

In transitive verbs, CMs indicate agreement with the direct object, which is 
always in the absolutive case (e.g. (6)). In intransitives, CMs indicate agreement 
with the subject, which may be in the absolutive or the ergative (e.g. (7)). 

Verbs with first person singular, first person plural exclusive, or second person 
singular or plural subjects or direct objects agree in person, number, and case with 
those arguments. Third person singular and plural and first person plural inclusive 
do not condition person-number agreement. Table 1 presents the person-number 
agreement markers, and (8-11) illustrate their use. (6) illustrates the lack of per- 
son-number-case agreement with third person arguments. 


Table 1. Ergative and absolutive agreement markers in Batsbi and the independent 
pronouns they derive from 


Agreement suffixes Independent pronouns 
Ergative Absolutive Ergative Absolutive 
1st SG -as -s” as so 
2nd sc -ah -h” ah ho 
Ist EXCL -atx -tx” atx txo 


2nd PL -e8, -ais§ -is” es, ais Su 
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Morpheme-initial /a/ is affected by certain preceding vowels; lip-rounding of 
word-final consonants (written here as C”) is optionally lost, as is second person 
singular h. 


(8) éai-l Kazik moc’ let’-d-o-s. 
tea-ALLtt a.little honey(d/).ABs add-CM-PRES-1SG.ERG 
‘Tadd a little honey to my tea’ 
(9) ah vux teg-d-in-a(h) 
you.ERG what.aBs do-CM-AOR-2SG.ERG 
“What did you do?’ 
(10) atx qit-n-atx 
we.ERG understand-Aor-1PL.ERG 
“We understood? 
(11) psare —y-ax-it-n-es es fe) 
yesterday CM-g0-CAUS-AOR-2PL.ERG yall.ERG him/her.aBs 
‘Yesterday y'all let him/her go’ 


2. Description of metathesis in Batsbi 


In Batsbi, transitive verbs form the present and future tenses with the marker is -o; 
this paper is limited to a consideration of transitives. Under circumstances to be 
identified below, this -o metathesizes with the preceding consonant. The paradigm 
in (12) illustrates this; note the difference between the first and second persons of 
both numbers, on the one hand, and the third person examples, on the other. 


(12) y-o?-y-o-s cM-bring-CM-PRES-1sG.ERG] ‘Twill bring her’ 


[ 
y-o?-y-o-(h) [cm-bring-cM-pRES-2sG.ERG] ‘you will bring her’ 
y-o?-0-y [cM-bring-PRES-CM] ‘s/he will bring her’ 
y-o?-y-o-tx  [cm-bring-CM-pRES-1Ex.ERG] ‘we will bring her’ 
y-o?-y-w-i8 | [cm-bring-cM-pRES-2PL.ERG] ‘Yall will bring her’ 
y-0?-0-y [cM-bring-PRES-CM] ‘they will bring her’ 


The element glossed “CM” is the class marker described above. Future tense, like 
the present, is formed with -o, which I gloss ‘pres’; future differs from present 
(in verbs that distinguish them) in that future is formed on the perfective stem, 
while present is formed on the imperfective stem. In the paradigm in (12), the 
CMs are feminine gender, singular (y-). As one can see from (12) and other para- 
digms, third persons agree in gender-number, but not in person-number, realized 
in the last suffix. (13) provides examples with different genders. 
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(13) a. bader d-o?-o-d 
child(d/d).aBs cm-bring-PRES-cM 
‘S/he will bring the child’ 
b. v-o?-v-0-s 
cM-bring-CM-PRES-1SG.ERG 
‘T will bring him’ 


As one can see in (12), in the first and second persons, the order is CM followed by 
-o, while in the third person it is -o followed by CM. In order to show that -CM-o- is 
the basic order, I show another tense paradigm in (14), built on the present. In the 
imperfect, in (14), the present tense morpheme, -o, precedes the imperfect marker, 
-ra, and both follow the CM in all person-number combinations. In other paradigms, 
too, the tense marker follows the CM, and I conclude that -CM-o- is the basic order. 


(14) let’-d-o-ra-s [add-cM-PRES-IMPF-1SG.ERG]  ‘T used to add it’ 
let’-d-o-ra-(h) | [add-cM-PRES-IMPF-28G.ERG] ‘you used to add it’ 
let’-d-o-r [add-CcM-PRES-IMPF] ‘she used to add it’ 
let’-d-o-ra-tx [add-CcM-PRES-IMPF-1PL.ERG] ‘we used to add it’ 
let’-d-o-r-e8 [add-CM-PRES-IMPF-2PL.ERG] ‘Yall used to add it’ 
[ 


let’-d-o-r add-cM-PRES-IMPE] ‘they used to add it’ 


Metathesis occurs when -o would otherwise be word-final, as it would in the third 
person forms in (12), since third persons do not condition person-number agree- 
ment. It is probably not irrelevant that word-final vowels in polysyllabic words are 
reduced (Holisky and Gagua 1994). In general, word-final -u and -o are realized as 
labialization of the preceding consonant, but they may alternatively be lost com- 
pletely; the unround vowels are always lost in the speech of younger speakers. This 
vowel reduction is the source of lip-rounding in the agreement suffixes shown in 
Table 1, which derive from the pronouns so T, ho ‘you, txo ‘we (ExcL); and su ‘y'all’ 
Third person forms in (14) and other paradigms show loss of word-final a in ra. 

Metathesis is not dependent on the person and number of the form but on 
whether -o would be word-final. This is shown by the fact that the first person in- 
clusive, which does not condition person-number agreement, also shows meta- 
thesis, as in (15). With the inclusive, one must use an independent pronoun or an 
enclitic pronoun. 


(15) ve y-0?-0-y 
we (INCL) CM-bring-PRES-CM 
‘we (INCL) bring her’ 


The question form helps to establish both the underlying order and the fact that 
metathesis is dependent upon -o being otherwise final in the phonological word. 
Yes-no questions are formed with = i enclitic. 
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(16) a. let-d-w=i /let’-d-o=i/ [add-cM-PRES-Q] ‘is s/he adding?’ 
b. let’-d-o-r=i /let’-d-o-ra=i/ [add-cM-pREs-IMPF-Q] ‘was she adding?’ 


The forms in (16) show, further, that metathesis in Batsbi does not occur for the 
purpose of avoiding consonant clusters. 

Below are partial paradigms showing metathesis in the third person in other 
verbs. 


(17) lah-d-o-s [pick-cM-PRES-1SG.ERG] ‘T pick (e.g. flowers)’ 
lah-d-o-(h) [pick-CM-PRES-2SG.ERG] ‘you pick (e.g. flowers)’ 
lah-o-d [pick-PRES-CM] ‘s/he picks (e.g. flowers)’ 

(18) lev-d-o-s [say-CM-PRES-1SG.ERG] ‘I say (e.g. word)’ 
lev-d-o-(f) [say-CM-PRES-2SG.ERG] ‘you say (e.g. word)’ 
lev-o-d [say-PRES-CM] ‘s/he says (e.g. word)’ 


So far, all examples have shown a CM following the root, but only derived transi- 
tives have this configuration. In basic (underived) transitives, the tense marker 
immediately follows the root. No metathesis occurs if the -o immediately follows a 
root, as illustrated in (19) and in the partial paradigms in (20-21). 


(19) meq y-aq-™ 
bread.ABS CM-eat-PRES 


“He/she/they eat bread? 

(20) ebc’-o-s [weigh-PRES-1SG.ERG] ‘T weigh (TR)’ 
ebc’-o-(h) [weigh-PRES-2SG.ERG] ‘you weigh (TR)’ 
ebc-” [weigh-PREs] ‘s/he weighs (TR)’ 

(21) y-al-o-s, v-al-o-s [cm-bring.IMPFV-PRES-1SG.ERG] 
y-al-o-(h), v-al-o-(h) [cm-bring.IMPFV-PRES-2SG.ERG] 
y-al-*, v-al- [cM-bring.IMPFV-PRES] 


‘T bring her, him’ 
‘you bring her, hin’ 
‘s/he brings her, hin’ 


In (19-21) the o is reduced to labialization in the third person forms. Xerc- ‘change’ 
is a transitive verb that may either have or lack the post-radical CM, with a subtle 
difference of meaning as illustrated in (22). 


(22) a. kud xerc-o-b /xerc-b-o/ 
hat(b/d).aBs change-PRES-CM 
‘S/he changes her/his hat’ 
b. kud xerc- 


change-PRES 
‘S/he changes her/his hat’ 
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When a CM is present, as in (22a), there is metathesis; but when no CM is present, 
as in (22b), there is no metathesis, and the word-final -o is reduced to lip-rounding 
(or lost altogether). 

Since CMs, which are always consonants, metathesize, and other consonants 
do not, as shown in (19), (20-21), and (22b), it is natural to ask what happens when 
the final consonant of the root is one of the consonants that also occurs as a CM - 
namely v, y, b, or d. Using an electronic database of Batsbi verbs, I searched for verb 
roots ending in these consonants. I found one transitive root that apparently ends 
in -v, three in -y, one in -b, and four in -d. However, some of these lexemes seem to 
end in a frozen CM, which was not part of the inherited root. For example, in the 
verb toldar ‘surpass, be better than; the root appears to be told-; but Ingush has a 
word tuolar [twolar] ‘surpass, be better, be superior (Nichols 2004: 339a), while 
Chechen has toellash [tycellas] ‘better, superior’ (Nichols & Vagabov 2004: 404b). 
In my view, then, dis nota part of the root of Batsbi toldar, and several other verbal 
lexemes may have the same problem. Kadagize & Kadagize (1984) gives the verb 
get’yar ‘open, written as though the root were qet’y-, but in fact this verb has the 
form get’b- when used with the object bak ‘mouth, showing that the final conso- 
nant is not part of the root but a CM. The verb qobzyar ‘kindle’ appears to have a 
root ending in /y/, but in fact it can have only the object ce ‘fire} which requires the 
gender marker -y in both singular and plural. The words for ‘candle’ and ‘oven, for 
example, cannot be objects of this verb. I conclude that /y/ is not part of the root of 
this verb. The only transitive lexeme I have been able to identify with a root-final v, 
y, b, or dis abar ‘sew’? (23) lists the third person singular subject present tense form 
of this verb, together with a few intransitive lexemes with one of these consonants 
in root-final position and forming the present with -o, as transitives do. 


(23) a. /tiv-o/ tiv” ‘s/he rests’ 
b. No examples with y at the end of the root. 
c. /ab-o/ ab” ‘s/he sews’ 
d. /a"hd-o/ athdo ‘s/he moans, grieves’ 


‘Thus, it appears that, to the extent that verb roots may end in y, y, b, d, these con- 
sonants do not participate in metathesis. 
We can state the conditions on metathesis as in (24). 


(24) Metathesis with o occurs in transitive verbs if 
- owould otherwise be final in the phonological word 
- ois preceded by a CM. 


2. It is likely that inherited roots that originally ended in one of these consonants were either 
reanalyzed as containing a CM or were otherwise changed to avoid confusion with a CM. In 
roots of the form (C) VC, root-final y, y, b, dare less likely to be confused with a CM than in roots 
of the form (C)VCC or (C) VCCC. 
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We can state the metathesis as in (25), clearly a morphologically conditioned rule. 
(25) C+o0>0C/+__# 


It has often been noted (see for example Hume 2001) that stating metathesis for- 
mally is always a problem, in that we must put two elements to the left of the arrow. 
In this case, we must actually put three symbols to the left of the arrow. The “+” to 
the left of the arrow is necessary to prevent the application of metathesis in words 
such as so T, ho ‘you; etc. In the environment of the rule, the “+” prevents metath- 
esis from occurring incorrectly in the third person forms in (23). The “#” is neces- 
sary to prevent metathesis from applying in the first and second person forms in 
all of the paradigms above, as well as in the imperfect tense, illustrated in (14). 

I conclude that the phenomenon described above is synchronically genuine 


CV metathesis, albeit morphologically conditioned, as stated in (25). 


3. On the origins of metathesis in transitive verbs 


In this section, I sketch a hypothesis of the origin of metathesis in Batsbi. In Proto- 
Nakh, verb roots were always followed by at least one tense marker (such as *-o for 
present of transitive verbs) or by a formant of a non-finite form (such as an infini- 
tive in *-an or a verbal noun in *-ar). A tense marker such as *-o could be followed 
by the formant of a participle or absolute, described further below. At this stage, 
verbs potentially had the parts relevant here, though additional suffixes may have 
been possible. Occurrence of the CM was lexically determined. 

In Proto-Nakh and in pre-Batsbi (that is, both before and after the separation 
from Chechen-Ingush), several auxiliaries or light verbs grammaticalized to the 
main verb. (Henceforth I assume that these were auxiliaries, not light verbs.) Since 
these were verbs in their own right, they too had the parts listed above. Thus, some 
of the grammaticalizing auxiliaries brought with them their own prefixes, and all 
brought suffixes. Thus, in Proto-Nakh and in pre-Batsbi, we would have found the 
patterns listed in (26), among others. 


(26) a. (CM)-ROOT-TENSE-({PARTICIPLE, ABSOLUTE}) 
b. (CM)-ROOT-TENSE-({PARTICIPLE, ABSOLUTE}) (CM)-AUX-TENSE- 
({PARTICIPLE, ABSOLUTE}) 


Often, when auxiliaries grammaticalize, trapping affixes between them and the 
root of the main verb, this trapped morphology is lost (Harris and Faarlund 2006). 
We cannot predict when it is lost, and when it remains. In pre-Batsbi the trapped 


3. Note that (25) specifies that the input is + C + o #. 


230 Alice C. Harris 


CMs were not lost (see Harris 2008 for more detail), but trapped tense markers 
were generally lost. 

One of the auxiliaries that grammaticalized was the verb dar ‘make, do. Here, 
d- is the default gender-number marker (representing the full range of gender- 
number markers), and -ar is the formant of the verbal noun. The root itself is -i, 
but in most word-forms it is not realized. This verb is still used as an independent 
verb, as in (27). 


(27) éxindur d-o 
sock(d/d).ABS CM-PRES 
‘[she] makes (knits) socks’ 


(28) provides an example of this verb in its grammaticalized form, where it be- 
came a transitivizer (see also (12-18) above). 


(28) v-o?-v-i-e" 
CM-come.PERF-CM-TR-AOR 


‘She/he/they brought him’ 


Schema 1 shows what happened in the course of grammaticalization in the aorist 
and most other tenses. The double vertical lines indicate a word boundary. 


Afl- Core -Af2 Af3- Periphery | -Af4 
v- o? -e" v- -i/o -e" 
CM root AOR CM AUX AOR 
x 
Afl- Core -Af2 -Af3 Periphery | -Af4 
v- o? -Vv -l -en 
CM root [cM TRANS __| AOR 


Schema 1. Fusion of d- 0?-ar ‘come with d-ar ‘make, do’ in aorist 


The main verb here is the verb with the root 0? ‘come’ With it is the gender-num- 
ber prefix, v- (masculine singular) and the aorist tense suffix -e”; the form of this is 
discussed further below. The light verb ‘make, do’ became the transitivizer, and in 
(28) and Schema 1, its root, -i, is realized. Its gender-number prefix, v-, and aorist 
tense marker, -e", accompany it. The aorist suffix of the main verb is lost in the 
course of grammaticalization, and we do not see it in (28) above. 

While Schema 1 shows what occurred in most tenses in Batsbi, the present 
tense in -o behaved a bit differently in grammaticalization. Schema 2 shows what 
happened in grammaticalization of the present. 
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Afl- Core -Af2 Af3- Periphery | -Af4 
v- o? -0 v- -i/o -0 
CM root PRES CM AUX PRES 
x 
Afl- Core -Af2 -Af3 Periphery | -Af4 
CM root PRES CM TRANS PRES 


Schema 2. Fusion of d- 0?-ar ‘steal’ with d-ar ‘make, do’ in o-present 


In the o-present tense, unlike other tenses, the main verb’s tense marker, -o, was 
not lost. The output of this change is a form like that in (29). 


(29) *v-o?-o-v-” 
CM-come-PRES-CM- PRES 
‘She/he/they bring him? 


While (29) is reconstructed, similar forms from other verbs are preserved in 
Kadagize and Kadagize (1984), as illustrated in (3) and (33-36) below. It is from 
the forms preserved in their dictionary that we know that the present tense be- 
haved somewhat differently in grammaticalization. 

While some languages use tensed main verbs in constructions with auxiliaries 
(for examples, see Anderson 2006, Chapter 4 et passim), as hypothesized in 
Schemas 1 and 2, others use non-finite forms. Although we do not know what the 
Proto-Nakh of this period used, it is likely that it used a participle or absolute verb 
form. The form of the aorist participles and absolutes is not relevant here, since 
they were lost during grammaticalization, as shown in Schema 1. The present and 
future participles in pre-Batsbi were formed from the present and future stems 
with the addition of *-ni, realized today as -n” or -n (Holisky and Gagua 1994). 
Batsbi also possesses an absolute form of the verb, formed by adding -s to the pres- 
ent or future stem. Examples are provided in (30). 


(30) Present/future tense Present Present 
/future participle _ /future absolute 
do ‘makes, does’ duyn ‘making’ dos ‘while making’ 
tegod ‘builds’ tegduyn ‘building’ tegdo’ ‘while building.pres’ 
tagod ‘willbuild’ tagduyn ‘to be built’ tagdos ‘while building.FuT’ 
lelod ‘holds’ lelduyn ‘holding’ leldos ‘while holding’ 


dik’ ‘takes (ANIM)’ dikuyn ‘taking’ dikos ‘while taking’ 
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In the participles in (30) above, uy is the regular reflex of o when iis reduced in the 
following syllable (Mikelaze 1977). (This itself may be an example of Blevins and 
Garrett's prosodically induced VCV metathesis.) 

Both participles and absolutes are used in construction with auxiliaries in 
Batsbi today, as illustrated in (31) and (32), respectively. 


(31) so © skol-i v-ot’-uy-n co. v-a-s. 
I-aBs school-in CM-go-PRES-PTCPL NEG CM-be-1SG.ERG 
‘I (MALE) am not going to school’ 


(32) so bader’ d-ik-o-§ y-a-s™... 
I.aBs child.aBs CM-take-PRES-ABSL CM-be-1SG.ABS 
‘Iam taking the child...’ 


(In (31-32), the case of ‘T is governed by the auxiliary, ‘be not by the main verb.) 
It is likely that the main verb in Schema 2 would have been either in the present 
absolute or the present participle form. In the course of grammaticalization, all 
tenses and formants of non-finite forms were lost except the -o of the present. 
‘The gloss “present” in Schema 2 and below, can be understood as remaining from 
the present tense alone, or as remaining after the loss of the following formant of 
the participle or absolute. 

There is evidence that the present tense marker -o was retained as indicated in 
(29). Kadagize and Kadagize (1984) is a Batsbi-Georgian dictionary with example 
sentences for nearly every entry. It was written by a father-son team, both academ- 
ics, born in 1861 and 1895, respectively. The work was published in 1984, eight 
years after the son's death. It is not clear exactly when the examples date from, but 
it is likely that they reflect the most conservative speech possible, at least that of a 
generation that was already elderly in the first half of the twentieth century. As il- 
lustrated in (33-36), examples in this dictionary reflect the presence of two in- 
stances of present tense -o, both the trapped morpheme and that at the periphery 
of the word. 


(33) qar-o-v alza" —hal=y-aq-o-y-” (Dict 877b) 
rain(y/y)-OBL-ERG river.ABS Uup=CM-raise-PRES-CM-PRES 
‘Rain makes the river rise’ 


(34) Sota-s burt labc’-o-b-” (Dict 354b) 
Shota-ERG ball.ass play-PRES-CM-PRES 
‘Shota is playing ball’ 

(35) ma d-ic’-d-aq-o-d-” is bader.... (Dict 314a) 


NEG CM-forget-CcM-raise-PRES-CM-PRES this child(d/d).aBs 
‘Don't make this child supercilious!’ 
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At least one example even preserves the full final -o0, showing that reduction of fi- 
nal vowels was still not obligatory. 


(36) cinus y-Opxsopx-0-y-o (Dict 495b) 
bride(y/d).aBs cm-dress.specially-PRES-CM-PRES 
“They are dressing the bride specially’ 


For my main Batsbi informant, not born until 1953, the verb forms in (33-36) are 
impossible; instead she gives each without the final lip-rounding, as in (37), reject- 
ing the forms in (33-36): 


(37) a. y-aq-o-y (**y-aq-o-y-”) 
b. labc’-o-b (**labc’-o-b-) 
c. d-ic-d-aq-o-d_ = (**d-ic’-d-aq-o-d-") 
d. y-opxsopx-o-y (**y-opxsopx-o-y-™) 


As observed above, reduction of word-final vowels is a productive process. Al- 
though the dictionary generally gives complete forms with reduced vowels in 
word-final position, today it is always possible to omit word-final vowels instead of 
reducing them. The forms in (37) are produced as summarized in (38). 


(38) Pre-Batsbi: *y-aq-o-y-o ‘raise (cf. (33))4 
Early Batsbi, with vowel reduction: y-aq-o-y-" 
Early Batsbi, with optional vowel loss: y-aq-o-y 


I suggest that metathesis in Batsbi originated through reanalysis of forms like those 
in (38), as indicated in (39). 


(39) Early Batsbi, with vowel reduction: y-aq-o-y-™ 
CM-ROOT-PRES-CM-PRES 
Loss of word-final labialization CM-ROOT-PRES-CM 
Reanalysis CM-ROOT-PRES-CM 


The process indicated in (39) is not reanalysis in which the function of a mor- 
pheme is interpreted as something other than its previous function; here, the func- 
tion of -o has not changed.° This is reanalysis in the sense that speakers must have 
begun at this point to consider the final stage in (38/39) to be the result of metath- 
esis; they considered the structure of the form to be different, even though the 
linear order did not change. That is, elsewhere in the language, CMs precede tense 


4. This form is reconstructed for this verb but attested in (36) for the verb there. 

5.  “Reanalysis ... changes the underlying structure ...and ... does not involve any modification 
of ... surface manifestation” (Harris & Campbell 1995). See also Langacker (1977: 58) and 
Timberlake (1977). 
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markers and do not follow them. Since the CM in the final stage of (38/39) follows 
the tense marker, speakers, unaware that there were earlier two tense markers, 
must have looked upon this order as the result of metathesis. 

Marking of first and second person by suffixes seems to have originated early 
in the twentieth century, and it was at first optional. Note that the trapped -o would 
not have occurred in surface forms of the first and second person, because syn- 
cope applies regularly in these forms (Mikelaze 1977), as illustrated in the para- 
digm of the present in (40). ((40) is (12) with the addition of historical forms.) 


(40) Attested Historical Gloss Translation 
y-o?-y-o-s_/ y-0?-o-y-o-as/  [cm-bring-cM-pRES-1sG.ERG] T will bring 
her’ 
y-o?-y-o-(h) / y-02-0-y-o-ah/ [cm-bring-cM-PRES-2sG.ERG] ‘you will 
bring her’ 
y-o?-o-y _/ y-o?-o-y-o/ —_ [cm-bring-prEs-cm] ‘s/he will 
bring her’ 
y-o?-y-o-tx / y-0?-0-y-o-atx/ [cmM-bring-cM-PRES-1PL.ERG] ‘we will 
bring her’ 
y-o?-y-w-is / y-0?-0-y-o-ai8/ [cmM-bring-CM-PRES-2PL.ERG] ‘yall will 
bring her’ 
y-o?-o-y _/y-o?-o-y-o/ —_ [cM-bring-pRES-CM] ‘they will 
bring her’ 


In (40) syncope applies in the first and second persons, singular and plural. (In the 
third person, word-final vowel loss applies.) 
The same process of syncope would have been conditioned in the imperfect, 


(41). 
(41) Attested Historical Gloss Translation 

ebcras /ebc’-o-ra-as/ [weigh-PRES-IMPF-1SG.ERG] ‘I was weighing 
(TR) 

ebcrah /ebc’-o-ra-ah/ [weigh-PRES-IMPF-2SG.ERG] ‘you were 
weighing (TR)’ 

ebcor /ebc’-o-ra/ [weigh-PRES-IMPF] ‘s/he was weighing 
(TR) 

ebcratx /ebc’-o-ra-atx/ [weigh-PRES-IMPF-1EX.ERG] ‘we were weighing 
(TR) 


ebcres /ebc’-o-ra-ai8/ [weigh-PRES-IMPF-2PL.ERG] ‘yall were 
weighing (TR)’ 

ebcor /ebc’-o-ra/ [weigh-PRES-IMPF] ‘they were 
weighing (TR)’ 
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Thus, this analysis explains the discrepancy between the order of the third person 
and that of first and second persons. In first and second persons, a regular process 
of syncope (which applies in other environments as well) eliminates the trapped 
-o, but the second -o surfaces. In the third person the regular process of word-final 
vowel reduction reduces, later eliminates, the second -o in the present tense; there 
is then no vowel to condition syncope. This analysis also explains the fact that v, y, 
b, d metathesize only when they are CMs: historically a trapped -o occurred only 
in lexemes in which the auxiliary dar ‘make, do’ grammaticalized as a transitivizer. 
Once the pattern was reanalyzed as metathesis, it was never generalized to all con- 
sonants (or even to the same consonants when not functioning as CMs), but re- 
tains its morphological restriction. 

As a summary, I give the reconstructed forms in (42), based on the verb of 
(12), v-o?-o-v-” ‘she/he/they will bring him’. These forms are reconstructed as un- 
derlying forms in early Batsbi, well after grammaticalization of the auxiliary (which 
began in Proto-Nakh), after grammaticalization of the pronoun as person-number 
agreement (specific to Batsbi), and before loss of the medial -o morpheme.® 


(42) Today Reconstruction Gloss Translation 

vo?vos *v-o?-0-v-@-o-as [cM-bring-pRES-cM- ‘I will bring him’ 
TR-PRES-1SG.ERG] 

vo?voh *v-o?-o-v-@-o-ah [cm-bring-pres-cm- ‘you will bring him 
TR-PRES-2SG.ERG]| 

vo?ov *v-0?-0-v-@-o [cM-bring-PRES- ‘s/he will bring hin’ 
CM-TR-PRES]| 

vo?votx *v-o?-0-v-@-o-atx [cm-bring-pREs-cm- ‘we will bring hin’ 
TR-PRES-1EX.ERG] 

vo?vwis *v-o?-0-v-@-o-aisu [cm-bring-pREs-cm- ‘yall will bring him’ 
TR PRES-2PL.ERG | 

votov *v-o?-o-v-G-o [cm-bring-PRES- ‘they will bring hin’ 
CM-TR-PRES]| 


Although I have labeled the second column “Reconstruction”, forms of exactly 
these structures are preserved in (33-35) and many other examples in Kadagize & 
Kadagize (1984). The present-day first and second person forms are derived from 
the reconstructed forms by syncope, a very productive, regular process (Mikelaze 
1977); the syncope eliminates the medial -o. (The vowel a is also lost from each of 
the person-number suffixes, and the second person plural suffix causes changes in 
the peripheral -o.) The third person forms undergo word-final vowel reduction, in 
this case complete loss. This is entirely regular, although reduction to lip-rounding 


6. At this stage, the person-number-case markers were probably enclitics rather than suffixes. 
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is a possible alternative. These two changes together create the illusion of metath- 
esis, which is later reanalyzed as true metathesis. 


4. ‘The place of Batsbi o-metathesis in the Blevins & Garrett typology 


If the analysis set out above is correct, it is clear that Batsbi “metathesis” did not 
originate through a diachronic process of metathesis. Rather, it originated through 
the grammaticalization of a syntactic pattern, as indicated in Schema 2, followed 
by the application of productive phonological rules and reinterpretation of third 
person forms, as in (38-39). In this sense, Batsbi is clearly an example of what 
Blevins and Garrett call “pseudometathesis”. As far as I am aware, this is the first 
example of metathesis described where the origin is through grammaticalization, 
followed by phonological processes and reanalysis. 

Language learners hearing the forms of (42) that are labeled “today” would 
have no way of distinguishing this process from other morphologically condi- 
tioned metathesis. Not hearing the reconstructed but omitted 0’s, these native 
speakers would have no way of knowing the history of the present-day forms. 
Thus, while the Batsbi process focused on in this paper did not originate through 
a diachronic process of metathesis, I would argue that from a synchronic point of 
view this is genuine metathesis. 


5. Conclusion 


Batsbi metathesis results naturally from a diachronic process which is well known 
(grammaticalization) but which has not previously been identified with the ori- 
gins of metathesis. While Garrett and Blevins have found morphological origins 
for metathesis in some other languages (Garrett and Blevins 2009), as far as I am 
aware, no grammaticalization origin has previously been proposed. There are two 
reasons metathesis with such origins might be somewhat uncommon. First, in 
some languages the form of a main verb that is used with an auxiliary does not 
have the same or similar tense marking, as it must have had in Proto-Nakh. Nev- 
ertheless, there are many languages where this does occur. Further, there are other 
languages where a simple tense, such as present, forms the stem for formation of 
non-finite forms, as it does in Batsbi. Second, there is a tendency for morphemes 
trapped in the process of grammaticalization to be eliminated, not retained (Harris 
and Faarlund 2006). In spite of these two factors, it seems likely that the same 
pathway would lead to metathesis in other instances, some of which would leave 
no trace. If only present-day Batsbi were attested, we would have no way of 
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knowing that the pattern we see in (12) did not originate through a historical pro- 
cess of metathesis. Since origins of this kind cannot always be identified, we can- 
not reliably distinguish a historical process of metathesis from metathesis that has 
resulted from other processes. 
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Indefinitely definite expressions 


Jerrold M. Sadock 
University of Chicago 


Languages with articles indicating the definiteness or indefiniteness of noun 
phrases sometimes allow certain forms to occur without articles. In several such 
cases that are examined here, the expressions without articles are neither definite 
nor indefinite in and of themselves, but can be interpreted either way in one 
context or another. One case from Old Norse and two cases from English that 
work this way will be discussed. But the direct correlation between the presence 
of an overt article and an indication of (in)definiteness, and the absence of an 
article and the lack of such an indication cannot be maintained. Proper names 
without an article are definite, and count noun plurals and mass noun singulars 
are indefinite without an article. 


My interest here is in the properties of article-less noun phrases that contrast with 
alternatives with articles. As one example of the sort I am interested in, consider 
the use of definite articles in Old Norse as discussed in several places in the writ- 
ings of Jan Terje Faarlund. Here is an example from Faarlund (2004): 


(1) The function of the definite article is to express unique and specific refer- 


ence. During the Old Norse period, the use of the definite article was 
marked. The unmarked form without the article could thus also have 
unique and specific reference. 


In a subsequent article, Faarlund (2009: 632) cites Leiss (2000) to the effect that 
“The postposed definite article signals definiteness in syntactic indefiniteness en- 
vironments”. Faarlund writes: 


(2) This strong tendency can be illustrated by the following examples: 


[(25)] a. sat konungr ok drottning i haszeti (Hkr 1.338.13) 
sat king.NoM and queen.Nom in high-seat 
“The King and the Queen were sitting in the high seat.’ 
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b. peir sja nu skipin fyrir sér (Nj 66.6) 
they see now ships.acc.the before themselves 
“They now see the ships in front of them? 


In [(25a)], konungr ok drottning is an animate subject, and therefore expected to be 
definite with unique reference. A definite article would therefore be redundant. In 
[(25b)], by contrast, the definite article is necessary to avoid an indefinite reading 
of the inanimate object skip ‘ships. The definite article is thus the marked option. 

The comment is somewhat misleading, since one might conclude from it that 
only animate subjects can be definite without an article, which does not seem to be 
the case. Examples of definite, article-less noun phrases can be found that are ani- 
mate non-subjects or inanimate subject, or indeed are inanimate non-subjects. 
There are examples of all of these in Brennu-Njdls Saga, the lengthy saga from 
which Faarlund’s example (25b) is taken. In (3c) we find an indirect object in the 
dative case which bears no article and yet is clearly understood as coreferent with 
the shepherd introduced two sentences earlier. That is to say, it has the “unique and 
specific reference” that Faarlund takes to be what the definite article signals. Simi- 
larly, the subject noun skip “ship” in (4c) occurs without a definite article, but refers 
Hrut’s ship in (4a) and is therefore as definite in reference as the coreferent ship in 
the next sentence, where the definiteness is indicated by the suffixed article. And 
finally, in (5c) there is definiteness carried by an NP without an article despite its 
being inanimate and despite its being the object of a preposition. Here, as in (4), 
the reference to this already unique and specific entity is picked up again in (5d) 
by an NP that includes the definite article. 


(3) a. Skarphédinn sa smalamann Hallgerdar. 
Skarphedin saw.3.sc shepherd.acc.sc Hallgerd.GEN.sG 
‘Skarphedin saw Hallgerd’s shepherd? 


b. Pa  hafdi hann  hodgevid hofud af Sigmundi 
then had.3.s he.nom hew.pstpartT head.acc.sG off s.DAT.SG 
“Then he hewed off Sigmund’s head? 


c. Hann - seldi smalamanni i hendur hofudis ... 
he.NoM gave. 3.sG shepherd.pat.sc in hand.acc.pL head.the.acc.sG 
‘He handed the head over to (the) shepherd ...’ 


(4) a. Hrutr gekk sidan til skips og sigldii haf. 
Hrut.Nom went afterwards to ship.GEN and sailed into sea.acc 
‘After that, Hrut went to (the) ship and sailed out to sea’ 


b. Peim gaf vel byri og toku Borgarfjérd. 


them.DaT gave.3sG well fair.wind and took.3pL Borgarfjord.acc 
“They had a fair wind and reached Borgarfjord’ 
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c. Enn pegar er skip var landfast, 

and when RELpRT ship.NoM.sG was land.fast 

reid Hrutr vestr heim. 

rode Hrut west home 

‘And when (the) ship was made fast, Hrut rode home westward? 
d. enn Ozurr lét rydja_— skip-it. 

and Ozur let unload ship-the.acc.sG 

‘And Ozur had the ship unloaded? 


(5) a. Gunnar pakkadi honum og bjé skip sitt 
Gunnar thanked him and readied shipacc.se his.acc.sG 
snemmendis. 
early 


‘Gunnar thanked him and readied his ship early’ 


b. Hallvardr for ut med beim Kolskeggi. 
Hallvard.Nom.sG travelled out with them Kollskegg.Dat.sG 
‘Hallvard sailed out with Kolskeg and his men: 


Peir komu ut snemma sumars 

they came out early | summer.GEN.sG 
ok tdku Arnarbeelisds 

and took Arnarbaelsos 

‘and reached Arnarbaelsos; 

og var bat fyrir albingi. 

and was that before Allthing.pat.sG 
‘and that was before the Althing’ 


c. Reid Gunnar begar heim fra — skipi 
rode Gunnar then home from ship.pat.sG 
‘Gunnar then rode home from (the) ship’ 


d. en fékk menn til ad rydja__skipit 
and got men in.order to unload ship.the.acc.scG 
‘and got men to unload the ship: 


I think it goes without saying that NPs in Old Norse without a definite article, (and 
therefore with no article, since this language has no indefinite article) are most 
frequently used with indefinite reference, that is without indicating a unique and 
specific referent. Of eleven instances of NPs in Njal’s Saga that consist solely of the 
noun manni the dative singular of the word for “man” with no definite article, 
eight have clearly indefinite reference, and the remaining three have definite refer- 
ence. The abundance of indefinite readings is presumably a pragmatic matter, since 
an explicit indication of definiteness is available and the failure to use it can 
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implicate that it is inappropriate. If that is the case, then unarticulated NPs in Old 
Norse are not, in and of themselves, either definite or indefinite. In other words: 


(6) Old Norse NPs consisting only of a noun without a definite article! are 
neutral as to definiteness. 


English, as well as several other modern European languages, allow certain NPs to 
occur without articles, in particular, common noun plurals and common mass 
singulars. Such non-articulated NPs contrast with corresponding NPs with the 
definite article, as in Old Norse, but unlike Old Norse, the contrast makes the 
English NPs without articles invariably indefinite. Translated into English and 
with the singular replaced by a plural that can occur grammatically without an 
article, the bloody scene in (3) would go like this: 


(7) a. Skarphedin saw Hallgerd’s shepherds. 
b. Then he hewed off Sigmund’s head. 
c. Hehanded the head over to *(the) shepherds ....? 


The little story in (7) is coherent in English with the definite article in (7c) and 
grammatical, but incoherent, without it. Without the definite article, shepherds 
cannot refer to the same individuals that were introduced in (7a). But then what 
would it refer to? And what would the relevance of introducing shepherds in (7a) 
have been? In other words, the English NPs without an article, unlike the Old 
Norse NPs of similar form, are indefinite: 


(8) English plural and mass NPs without an article are indefinite. 


Grammarians who are fond of abstractions might want to assume that English has 
a null indefinite article for use with plurals and mass singulars and I might be in- 
clined to agree with them. But in reality, that is just a way of making the brute fact 
in (8) sound a little less brutish. What is clear is that there is a big difference 


1. ‘The reason for the restriction to a single noun here and in (8) and (9) is that NPs with de- 
monstratives, are definite, while NPs with indefinite quantifiers are indefinite, for example. A 
more general formulation is possible, but it would be complex and would obscure the point that 
Iam making with the restricted rule to this special case. 


2. Irelegate to a footnote the embarrassing fact that utterances of English that lack an article 
can receive definite interpretations. If 1 am out in the garden and yell to my wife, “Tomatoes are 
ripe!” I will probably intend my wife to understand that I am speaking about the very tomatoes 
that we both know are in the garden. But as a native speaker I feel, and other native speakers I 
have consulted agree, that despite phonic evidence, there is a definite article in such utterances, 
and it is not an abstract, inaudible definite article, but the regular, customary definite article the. 
The robustness of this intuition calls into question the frequent assertion that linguistics could 
in prinicple be carried out in the basis of textual data alone. Introspective facts are real, empiri- 
cal facts that cannot be and should not be eliminated. 
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between English and Old Norse bare NPs. Old Norse NPs without articles are not 
grammatically definite or indefinite, as we have seen, but they can be definite or 
indefinite in some sense. So before proceeding, we need to consider what that 
sense of definiteness is. 


Definiteness 


Bertrand Russell (1905) famously analyzed sentence (9) as a logical conjunction of 
the three propositions presented in prose as (10a)-(10c): 


(9) The present king of France is bald. 


(10) a. There is an x who is a king of France. 
b. No one other than x is a king of France. 
c. xis bald. 


The last statement is clearly needed: The meaning of sentence (9) includes a predica- 
tion of baldness of someone or another. If we agree that a proposition that is not true 
is false, then (10a) would seem to be part of the meaning as well, since it cannot be 
the case that the king of France is bald if there is no king of France. But there’s some- 
thing very wrong with the uniqueness requirement imposed by (10b). It says that (9) 
is false if there are two kings of France - even if both are bald. If the definite article 
always implies the unique applicability of a description, then anything you say about 
“the dog’, for example, would be false unless some canine disaster has occurred, leav- 
ing the world with but a single pooch. That is clearly not how we employ the definite 
article in English or in any other language which is described as having one. 

So the English definite article does not in and of itself demand that one and 
only one referent fit a description. But the word the clearly does do something dif- 
ferent from what is done by the indefinite article. There’s a difference between (11) 
and (12), and any reasonably thorough description of English has to account for it: 


(11) The dog is mangy. 
(12) A dog is mangy. 


If the definite article does not signal Russellian uniqueness but does do something, 
what does it do? One common assumption is that it’s something like Birner and 
Ward's (13), cited in Abbott (2005). 


(13)... what is required for felicitous use of the definite article (and most uses 
of other definites) is that the speaker must believe that the hearer is able to 
individuate the referent in question from all others within the discourse 
model. (Birner and Ward 1998: 122) 


244 Jerrold M. Sadock 


If (13) is on the right track, then the role of the indefinite article in languages that 
have one might be something like (14): 


(14)... what is required for felicitous use of the indefinite article (and most uses 
of other indefinites) is that the speaker must believe that the hearer is not 
able to individuate the referent in question from all others within the dis- 
course model. 


I would add that the hearer must actually be able to carry out the individuation in 
(13), and must be unable to carry it out in the case of (14), at least if infelicity is to 
include the full range of conversational mishaps discussed in Austin (1962). If, for 
example, the hearer is able to individuate an entity in the discourse model, a 
speaker who attempts to invoke that entity using an indefinite article will mislead 
the hearer, who will think the referent is something or someone other than the one 
she has individuated. 

Note that the requirements imposed on articles by (13) and (14), are prag- 
matic notions based on felicitous use, whereas Russell’s idea of the meaning of the 
definite was a truth-based, semantic idea. Faarlund’s discussion of the Old Norse 
definite article clearly inclines in some ways toward the pragmatic account of def- 
initeness. In (1) he speaks of the function of the article, rather than its meaning, 
and in (2) he agrees with Leiss’s use of the term signals (signalisiert in the original 
German), which can only be understood as a notion that involves a sender and a 
recipient, a pragmatic notion, that is to say. However, in assenting to Leiss’s (2000) 
formulation that the definite article in Old Norse “signals definiteness in syntactic 
indefiniteness environments” [“signalisiert Definitheit in syntaktischen Indefini- 
theitsumgebungen”], Faarlund subscribes to a less pragmatic view of definiteness 
than the facts warrant. There is nothing, so far as I can see, about the syntactic 
context of smalamanni in (3c), for example, that would suggest an unmarked def- 
inite reading that would make the use of the definite article unnecessary. The al- 
ternative with a definite article is certainly possible in the same context without 
any sense of redundancy. Rather, it is the discourse context (which includes the 
syntactic context) in which a referential term appears that promotes or inhibits 
one reading or another, as Birner and Ward’s definition in (13) suggests. Out of 
context either a definite or an indefinite understanding is possible in (3c), repeated 
here as (15). 


(15) Hann = seldi smalamanni i hendur 
he.NoM gave. 3.sG shepherd.pat.sc. in hand.acc.pL 
hofudio ... 


head.the.Acc.sG 
‘He handed the head over to (the/a) shepherd ...’ 
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Article deprivation 


In a many languages that require articles in some constructions, there are other 
nearly synonymous constructions where no article is possible in certain positions. 
In English, for example, there are two kinds of possessive constructions: preposi- 
tional possessives like (16a), and genitive possessives like (16b). 


(16) a. the/a cousin of the/a teacher 
b. the/a teacher’s cousin 


In both constructions, there are two semantic referents, one a teacher, and the 
other a child of a sibling of a parent of the teacher. In the prepositional possessive 
there are two positions for articles, one for each of the referential expressions. But 
there is only one article position in the genitive construction. We cannot have two 
articles in a row, one for the lower and one for the higher NP: *the a teacher's 
cousin,* and cannot have an article before each noun: *the teacher’s a cousin.* 

These basic facts can be handled quite straightforwardly in a grammatical 
framework in which semantic structures and syntactic structures are separately 
generated (Sadock 1991, 2012). Within such a system, the simple phrase structure 
grammar we see in (17a—d) is sufficient to handle the basic syntactic structures of 
the prepositional and genitive possessive constructions in English. 


(17) a. NP- Art/NP[GEN] +N 
b. NP[GEN] > NP + ‘s° 
c. N+N+PP 
d. PP—+P+NP 


3. In ordinary English structures, two articles cannot come together, but there is one special 
situation where they can, namely when one is in a title like “The Hague” or “The Young and the 
Restless.” Besides the very frequent (1), with 121,000 examples turned up in a Google search, 
there are a few doubly articulated examples of (ii) and (iii) (cf. Sadock 2006). 


i. the Young and the Restless star” 121,000 
ii “the the Young and the Restless star” 5 
iii. “a the Young and the Restless star” 3 


4. Constructions like this occur in other Germanic dialects including Yiddish: 


(i) dem _lerers a kuzine 
the.paT teacher.GEN a cousin 
“a cousin of the teacher” 


5. Lalso take morphological structure to be an independent level of representation. The fact 
that the genitive element in syntax is realized as a suffix attached to the last word in the NP it 
takes as a syntactic complement is handled in the morphological component as discussed in 
Sadock (2012). 
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(17a-d) correctly characterize the syntactic structures of the/a teacher’ cousin and 
the/a cousin of the/a teacher as (18) and (19), respectively. 


(18) NP 
ae 
NP[GEN] N 


a cousin 
NP 
a 


‘s 
Art N 
the/a teacher 


Art N 
the/a teacher 


Without going into detail,® the semantic structures of both kinds of possessive 
constructions involve the two propositional skeleta (20) and (21). 


(20) [(x) (COUSIN (y))] 
(21) [TEACHER (y)] 


For semantic well-formedness, each of the variables x and y must be bound by a 
quantifier. On the commonplace assumption that articles in syntax are quantifiers 
in semantic structure, the prepositional possessive (19) provides two articles and 
therefore two quantifiers, whereas the genitive possessive (18) offers only one 
quantifier. Sadock (2012) justifies a default principle of geometric correspondence 
between representations of the same expression in different modules. Since the 
single article in (18) is associated with the noun teacher, this correspondence prin- 
ciple will make the quantifier that is the meaning of the article bind y, the variable 
associated with the predicate TEACHER. The variable x, the semantic subject of 
the predicate COUSIN (y), will be unbound, making the semantic structure in- 
complete. But (18) is in fact well formed. 


6. A more extensive discussion of quantification and possession is to be found in Sadock 
(2012). 
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Definiteness spreading 


‘The supposition that is often used to reconcile the number of quantifiers in seman- 
tic structure and the number of determiners in syntax in examples like (18) is the 
idea that definiteness spreads from the single quantifier associated with the single 
article in the genitive construction to the other quantifier positions. This idea was 
introduced in connection with the extensive modern literature on a possessive 
construction in Hebrew, traditionally called the construct state nominal, or the 
smixut. 

Like English, Hebrew presents two possessive constructions, one with the pos- 
sessor in a prepositional phrase, and one with the possessor expressed as what is 
frequently called a genitive,’ formally a noun in a special morphological guise 
called the construct state. Though Hebrew has only a definite article, and displays 
head-dependent order, the facts are otherwise superficially similar to those of 
English: the prepositional construction allows each referent to bear an article and 
the construct state construction admits only one referent to be overtly definitized, 
namely the most deeply embedded one, as in English. In the genitive construction, 
the unarticulated references are normally all definite or indefinite depending on 
the definiteness of the most deeply embedded possessor, as the translations of ex- 
amples (22) and (23), adapted from Borer 2008, show. 


(22) a. delet bayit 
door[CS] house 
‘a door of a house 
b. delet ha-bayit 
door[CS] the-house 
‘the door of the house’ 
(23) a. delet beit mora 
house[CS] sons[CS] teacher 
‘a door of a house of a teacher’ 
a. delet beit ha-mora 
house[CS] sons[CS] the-teacher 
‘the door of the house of the teacher’ 


In Hebrew construct state nominals there is a syntactic consequence to the defi- 
niteness of the various nominal parts of the construction in Hebrew. The parts of 


7. This usage does not match what is called the the use of the word genitive in Indo-European. 
In Arabic and Hebrew for example, it is the head noun, not the dependent, that is in the con- 
struct state form, while the dependent occurs in its independent form. Example in the discus- 
sion that follows will make this clear. 
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a construct state nominal can be individually modified by adjectives which follow 
the entire nominal and occur in mirror-image order with respect to the compo- 
nents they modify. Furthermore, Hebrew adjectives agree with the heads that they 
modify in gender and number, and also - importantly — in definiteness. As Borer 
1988 originally pointed out, these facts provide excellent evidence as to the defi- 
niteness of those parts of the nominal that cannot support an article. 

In the following example from Borer (2008), we can tell which noun an 
adjective goes with because of the difference in gender between the masculine beyt 
(the construct-state form of bayit, “house”) and mora “female teacher”. If an adjec- 
tive is masculine, it modifies beyt, and if it is feminine, it modifies mora. 


(24) a. delet beyt ha-mora ha-vatiqa ha-xadas 
door.F[CS] house.M[CS] the-teacher.F the-senior.F the-new.M 
‘the door of the new house of the senior teacher’ 


b. delet beyt ha-mora ha-vatiqa ha-xadaSa 
door.F[CS] house.M[CS] the-teacher.F the-senior.F the-new.F 
‘the new door of the house of the senior teacher’ 


c. delet beyt ha-mora ha-xadasS_ha-levana 
door.F[CS] house.M[CS] the-teacher.F the-new.M the-white-F 
‘the white door of the new house of the teacher’ 


Notice that every referent in examples like these is made definite for agreement 
purposes by the definite article that appears only on the last, most deeply embed- 
ded constituent. Examples with the definite article missing from the adjectives are 
ungrammatical. But a crucial question is what sort of definiteness Borer’s facts 
demonstrate. 

In a recent paper, Danon (2010) endorses a point of view that automodular 
grammar is comfortable with, namely that morphosyntax and semantics are sepa- 
rate things and need not match exactly. He writes (Danon 2010: 148): 


- Hebrew has a morphosyntactic definiteness feature which is part of the syn- 
tactic representation. 

- Although this morphosyntactic feature plays a central role in constraining the 
semantic definiteness value of a noun phrase, the two notions of definiteness 
are distinct. 


The non-syntactic sort of definiteness that Danon is talking about is pragmatic, 
though he does not put it this way. It has to do with felicity of utterances in context 
rather than truth and falsity of isolated sentences. Though Borer’s demonstration 
of definiteness spreading in Hebrew construct state nominals is convincing as far 
as mophosyntax is concerned, Danon provides irrefutable evidence to the effect 
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that the interpretation of construct state nominals allows for indefinite interpreta- 
tions even if the head noun is definite, and that there are cases of understandings 
that are definite though the possessor lacks the definite article.* It seems, then, that 
the unarticulated possessed phrase in a construct state nominal is neutral as to 
definiteness. 

Let us return to English, where there is no morphosyntactic indication of the 
definiteness or indefiniteness of the non-articulated referents in the genitive pos- 
sessive construction. There is no test for syntactic definiteness such as there is in 
Hebrew, so I would like to introduce two tests based on felicity, that is to say, con- 
textual well-formedness, as probes for pragmatic definiteness and indefiniteness 
in English. 


Pragmatic tests for non-definiteness and non-indefiniteness 


(25) A contextually appropriate referring expression that can be appropriately 
replaced with an expression in which it is determined by the definite ar- 
ticle preserving reference, is non-indefinite. 

(26) A contextually appropriate expression that can be appropriately replaced 
with an expression in which it is determined by the indefinite article, pre- 
serving reference, is non-definite. 


The idea is that the definite and indefinite articles impose stringent requirements 
on the contexts in which they can felicitously refer, contexts that are often in com- 
plementary distribution. If I can refer to a particular entity in a particular context 
as the car, then ordinarily I will not be able to refer to it as a car in that contest. 
And if I can refer to it in a context as a car, then I will usually not be able to refer 
to it felicitously as the car. 

For expository ease, let us call a referring expression in which the referent is 
determined by a definite or indefinite article a primary definite or a primary in- 
definite, respectively. The negative definitions provided in (25) and (26) entail that 
no expression can be simultaneously definite and indefinite, because if an expres- 
sion cannot be replaced either by a primary definite or a primary indefinite ex- 
pression, the tests give no result. On the other hand, if an expression meets both 
(25) and (26), then it is both non-definite and non-indefinite, which is to say, it is 
neutral with regard to definiteness. This seems to be the right result in view of the 


8. One of Danon’s examples of this is the following: 
(i) Ota tmunat praxim, 
that/same picture.sm1xuT flowers, 
“that picture of flowers” (Danon 2009: 14) 
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fact that the requirements on the use of definite and indefinite article in (13) and 
(14) are logical contraries. 


Indefinite definiteness of English genitive possessives 


Now let us apply these tests to the English genitive construction in an effort to 
determine the discourse properties of its parts. Recall that in a bi-partite genitive 
structure, only the possessor can bear an article, the common assumption being 
that the possessum shares the discourse properties of the possessor. We can test 
that by asking whether a genitive form that is schematically like (27a) or (28a) can 
only be paraphrased as (27b), or (28b), where the single article is “spread” to both 
referents, or whether it can also be rephrased as (27c) or (28c), where the two ar- 
ticles disagree, without change of reference. 


(27) a. (theN1)’s N2 
b. (the (N2 (of the N1)) 
c. (a(N2 (of the N1)) 


(28) a. (aN1)’s N2 
b. (a (N2 (of aN1)) 
c. (the (N2 (of aN1)) 


To investigate this I searched the internet for phrases like “the/a neighbor’s” “the/a 
person's’, and “the/a woman's”. The contexts very frequently turned out to be news 
stories, a good genre for my purposes, since the writer often cannot assume that 
the reader will have any prior knowledge of the events that are being related, or of 
their participants. 

The short news article has some discourse peculiarities that need to be men- 
tioned. First, the headline precedes the rest of the article, but the information it 
contains is usually not treated as part of the discourse model. In the following 
story two characters are introduced in the headlines, but the story begins by refer- 
ring to them with NPs determined by the indefinite article, as if they were brand 
new entities. 


(29) Cancer-Stricken Granny Tackles Bank Robber 
Robbery suspect claims it was all an April Fool's joke 
By Todd Wright | Saturday, Apr 2, 2011 | Updated 8:45 PM EDT 
An elderly woman with “nothing to lose” stopped a bank robber in her 
tracks Friday, Broward Sheriff’s Office officials said. 
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Helen Dunsford, 66, was a customer at a Bank of America in Oakland 
Park when a woman reached in her purse, said she had a gun and yelled 
for everyone to get on the ground...” 


Secondly, as we see in (29), headlines usually employ an article-free grammar. In 
most cases we have no trouble filling in the articles that would be used if the head- 
line were part of the story itself. Here is how the headlines in (29) would read in 
normal English: 


(30) A cancer-stricken granny tackled a bank robber. The robbery suspect 
claimed it was all an April Fool's joke. 


Note that the article-deprived headline noun phrases cancer stricken granny and 
bank robber would require indefinite articles in ordinary prose, showing that the 
headline NP is non-definite. The NP robbery suspect, however, needs a definite 
article, since its antecedent has just been mentioned, and this shows that headline 
NP’s without articles are non-indefinite. In other words, article-less headline NPs 
are neutral with regard to definiteness, as was the case with article-poor NPs in 
Old Norse. 

There are, of course, many cases where the non-articulated referents in geni- 
tive possessives have the same discourse properties that they would have in a 
prepositional paraphrase with a copy of the possessor’s article. That is especially 
common when a genitive possessive has a definite article, as in (31) and (32). 


(31) Ballroom Dance Positioning 
... The womans right hand rests in the palm of the man’s left hand between 
his thumb and forefinger forming a “Y”.... 


(32) Womans life saved by fake breasts after husband stabs her in the heart 
By Daily Mail Reporter 
A Russian woman can thank her breast implants for a lot more than just 
her shapely figure. 
The Muscovite was the victim of an attempted stabbing by her husband - 
but her life was saved when the knife stuck in her silicone breast. ... 
A plastic surgeon, Sergei Blokhin, claimed the woman's husband asked his 
wife to have a breast enlargement five years ago. 


The dancing lesson could be rewritten as “The right hand of the woman rests in the 
palm of the left hand of the man between his thumb and forefinger forming a “Y”” 
It would be infelicitous, not to mention creepy, to say “A right hand of the woman 


9. _ http://www.nbcmiami.com/news/local/Cancer-119071249.htm]l 
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rests in the palm of a left hand of the man” since ordinary human beings have at 
most one right hand and one left hand. 

The phrase “the woman's husband” in the last line of (31) can be paraphrased, 
though clumsily, as “the husband of the woman” and it cannot be paraphrased as 
“a husband of the woman’, presumably because we live in a monandrous society, 
so any husband of a definite woman will be a uniquely individuable individual. 

It turns out to be remarkably difficult, however, to find examples of the form 
(28a) that can only be paraphrased as (28b). One possible example is the following: 


(33) Toy Maker Recalls Baby’s Rattle That Could Choke 
Child 
October 06, 1989|From Associated Press 
WASHINGTON - The manufacturer of a baby’s rattle announced Thurs- 
day that it was voluntarily recalling the toy because it could present a 
choking hazard to small children. ... 


The phrase that I’m interested in here is a baby’ rattle in the body of the story. The 
phrase could hardly be replaced by the rattle of a baby but that is for a reason en- 
tirely irrelevant to the present enterprise. The NP a baby is non-referential here, 
akin in meaning to the first element of the compound nominal a baby rattle, so the 
tests in (25) and (26) do not apply, the proof of which is that a baby’s rattle cannot 
even be paraphrased as a rattle of a baby, because now there is reference to an in- 
definite baby. 

In most instances where both “a N1” and “a N1’s N2” are referential, paraphras- 
es of the form “the N2 ofa N1” sound just as good or better in the same context, with 
the same reference. In some cases, in fact, a genitive possessive with one indefinite 
article cannot comfortably be replaced by a prepositional possessive with two in- 
definite articles. The phrase a person's dominant eye in (33) is not contextually equiv- 
alent to “a dominant eye of a person” because each person can have at most one 
unique, dominant eye. Spreading the indefiniteness is this case would suggest the 
existence of several dominant eyes, which is something of a contradiction. 


(33) Determination of ocular dominance 
A person's dominant eye “is determined by subjective alignment of two 
objects presented at a stereodisparity far beyond Panum’s area’. 


In other cases, such as (34), either a definite or an indefinite paraphrase of the 
genitive possessive will do: 


(34) Joey Porter’s dogs get loose, kill neighbor's dog 
Four dogs owned by NFL star Joey Porter got loose early July 2 from his 
northwest Bakersfield residence and killed a neighbor's dog. ... 
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We don’t know and don’t much care how many dogs the neighbor has, since only 
one was killed. If the neighbor happened to own more than one dog or if more 
than one neighbor owned one or more dogs, and the writer wanted you to know 
that, “.. killed a neighbor’s dog” could be replaced with “.. killed a dog of a neigh- 
bor’, but in all those cases, the definite article could also be used. The fact that 
Mr. Porter’s dogs killed only one dog of one neighbor (one hopes) makes that dog 
specific enough to allow for the use of a definite article, regardless of the number 
of surviving dogs a neighbor's household. 

Another example of the same sort is provided by a Fox News story that aired on 
August 25, 2011. Martha MacCallum introduced the story on camera, saying: 


(35) ‘A fallen Navy Seal’s dog refuses to leave his master’s side, even in death?!° 


‘The story was picked up and repeated in several places, one of which provided the 
following additional textual version: 


(36) Dog Mourns the Death of a Fallen Navy Seal 
The dog of a fallen hero, Jon Tumilson, stays by his side during the entire 
funeral all the way to the end. This is very sad and moving. May God be 
with the family of Mr. Tumilson and let us never forget those who are 
making the ultimate sacrifice." 


So here, not only can a phrase of the form “a N1’s N2” be appropriately rephrased 
as “the N2 of a N1’, it actually has been, and this shows that the reference of a 
genitive possessive construction in which the possessor is marked with an indefi- 
nite article is not itself indefinite. Fairly generally, then, a phrase of the form “a 
N1’s N2” can be replaced with “a N2 of a N1” or “the N2 of a N1”. In other words, 


(37) The unarticulated referent of an English phrase of the form ‘a N1s N2’ is 
neutral as to definiteness. 


We have seen that phrases of the form (27a), “the N1’s N2” can often be felici- 
tously be replaced by phrases of the form (27b), but sometimes they can also be 
replaced by “a N2 of the N1’, as in the following example: 


(38) Suspect in Deaf Woman’s Murder in Court 
DEKALB COUNTY, Ga. (MyFOX ATLANTA) - 
A murder suspect made his first court appearance Wednesday, just days 
after a deaf woman was found dead in her Avondale Estates apartment. 
The woman's neighbor, 21-year-old Darrel Charles Smith was arrested and 
charged with the murder... 


10. http://foxnewsinsider.com/2011/08/25/fallen-navy-seals-dog-refuses-to-leave- 
master%E2%80%99s-side-at-funeral/ 


11. http://www.godvine.com/Dog-Mourns-the-Death-of-a-Fallen-Navy-Seal--644.html 
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Since it is not unusual for Americans to have more than one neighbor, the same 
reference as is made by “the woman’s neighbor” in this case could just as well have 
been made by “a neighbor of the woman”. Therefore, according to definitions (25) 
and (26): 


(39) ‘The unarticulated referent of an English phrase of the form 
‘the N1s N2’ is neutral as to definiteness. 


Along with (37), this implies the more general proposition, 


(40) The unarticulated referent of an English genitive possessive of the form 
‘Art N1’s N2’ is neutral as to definiteness. 


Conclusion 


In three disparate cases - Old Norse article-less NPs, English telegraphic head- 
lines, and English genitive possessives, we have found that an unarticulated refer- 
ent is neutral as to definiteness. I may add to this the case of NPs in languages that 
lack articles altogether. In such languages, I am convinced that a purely pragmatic 
notion of definiteness/indefiniteness exists. Speakers know whether a referential 
term is intended to corefer with a contextually determinable entity or not and even 
in the absence of articles, something about the form of an utterance might make it 
clear. In any case, we have to conclude that simple NPs in languages without arti- 
cles are neutral as to definiteness. 

It would be tempting to say that what we have observed is a simple case of 
function following form: In the absence of an overt indication of definiteness or 
indefiniteness, a referential expression is neutral in that regard. But a direct func- 
tional account such as this cannot be maintained. As stated in (8), article-free mass 
and plural nominals in English are indefinite. I may add that proper names in 
English regularly appear without articles in definite uses. Where I can say to you: 
“Joe was here,’ I could also have said something like, “The plumber was here.” But 
I could not say “A plumber was here,’ referring to our good friend, Joe. 

So as we might expect, form and function often correlate in the working of 
natural languages, but do not always do so. There is such a thing as grammar. 
‘That's good news for us grammarians. 
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Language acquisition 


Doing diachrony* 


David Lightfoot 


Georgetown University 


Working on syntax through time requires good synchronic analyses of the initial 
and later I-languages. However, seeking to explain a change enables historical 
linguists sometimes to link the new I-language with particular features of the 
external language to which the innovating children were exposed. This enriches 
understanding of some of the idealizations of synchronic syntax, particularly 
the relationship between particular Primary Linguistic Data and the grammars 
triggered. 


Explaining change through acquisition 


In doing diachronic syntax, one needs good hypotheses for the early stage and the 
late stage of the language under investigation, for example about the grammars of 
Old Norse (Faarlund 2004) and modern Norwegian (Faarlund 2003); one needs a 
good synchronic analysis at both ends of any changes to be considered. That means 
that one needs all the ideas marshaled by synchronic syntacticians. However, there 
is more to diachronic work: questions arise that do not typically impinge on syn- 
chronic work, different research strategies are called for and certainly there are 
different research traditions. Some of this work casts light on the idealizations 
used in synchronic work and synchronic syntacticians have things to learn from 
work in diachrony. 

Historical linguists from the very beginnings of the field in the early nine- 
teenth century have always had a sharp eye for explanations, asking “why” ques- 
tions as soon as they began discovering the misnamed “laws” of sound change. 
Grimms Law dealt with changes in the Proto-Indo-European consonantal stop 
system in early Germanic but after colleagues like Hermann Grassmann had begun 


* Jan Terje Faarlund has done important work on the syntax of modern and ancient languages 


and on the changes taking place over generations of speakers. I reflect here on the results of a few 
decades of work on generative approaches to change that he and I have been involved in. 
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explaining exceptions to the “law,” Jacob Grimm knew that philologists needed an 
explanation for why those changes had taken place and turned toward some bad 
psychology toward the end of his life (Grimm 1848, Lightfoot 2006). There was a 
great deal of discussion about the explanation of sound changes, usually based on 
ideas about a universal directionality to change. People generally agreed that there 
was a directionality to change but there was no agreement on what the directions 
were. For an account of how historical linguists have sought explanations over the 
last two hundred years, see Lightfoot (2011). 

Over the last decades an approach has developed that links the explanation of 
syntactic changes to ideas about language acquisition and the (synchronic) theory 
of grammar. Baker & McCarthy (1981) identified the “logical problem of language 
acquisition” as one of identifying the three elements of the triplet of (1). 


(1) Primary Linguistic Data (Universal Grammar > grammar) 


Children are exposed to PLD and, as a result, their initial state characterized as UG 
develops into a mature state characterized by a particular grammar. The solution 
to the logical problem lies in identifying the three items in such a way that links a 
particular set of PLD to a particular grammar given particular ideas about UG. 
Under that approach, there can only be one way to explain the emergence of a new 
grammar: a new grammar will emerge when children are exposed to new PLD 
such that the new PLD trigger the new grammar.' 

The PLD consist in structurally simple things (in fact Degree-0 complex, 
Lightfoot 1999) that children hear frequently, robust elements of their E-language. 
Work in diachrony, therefore, makes crucial use of the E-language/I-language dis- 
tinction and keys grammatical properties to particular elements of the available 
PLD in ways that one sees very rarely in work on synchronic syntax. Successful 
diachronic work distinguishes and links two kinds of changes: changes in PLD 
(part of E-language) and changes in mature I-languages. The phenomena are quite 
different, as we will see when we consider some reasonably well-understood 
changes below. Work on language change in this perspective, linking changes to 
language acquisition by children, has been conducted now for some decades and 
there have been some surprising results that lead us to re-think the relationship 
between PLD and particular I-languages. 


1. There are other approaches that ascribe to UG not only a means of linking PLD to gram- 
mars but also “biases” that prefer certain grammars to others independent of the PLD and there- 
fore predict how grammars change over time (van Gelderen 2011, Roberts & Roussou 2003). 
This revives 19th-century and late-20th-century ideas about universal directions to change and 
fails to explain why relevant changes took place at particular times and under particular circum- 
stances and why they did not take place in languages where they are not attested. 
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(Chomsky 1986 distinguished E/I-language: E-language is the amorphous 
mass of language out in the world, the things that people hear. It is not a system but 
it reflects the output of the I-language systems of many speakers under many dif- 
ferent conditions, modulated by the production mechanisms that yield actual ex- 
pressions. Internal, individual languages, on the other hand, are mental systems 
that have grown in the brains of individuals and characterize the linguistic range 
of each of those individuals; I-languages (“T” for internal and individual) are re- 
presented in individual brains and are biological entities.) 


Models of acquisition 


Work in synchronic syntax has rarely linked grammatical properties to particular 
triggering effects, in part because practitioners often resort to a model of language 
acquisition that is flawed and is strikingly unilluminating for work on change. I 
refer to a model that sees children as evaluating grammars against sets of sen- 
tences and structures, matching input and evaluating grammars in terms of their 
overall success in generating the input data most economically, e.g. Clark & 
Roberts (1993), Gibson & Wexler (1994) and many others. A fundamental prob- 
lem with this approach is that I-languages generate an infinite number of structures, 
and, if children are seen as setting binary parameters, they must, on conservative 
assumptions that there are only 30 or 40 parameters, entertain and evaluate bil- 
lions of grammars, each capable of generating an infinite number of structures. 
For discussion, see Lightfoot (1999). Beyond these overwhelming issues of feasi- 
bility, this approach raises further problems for thinking about change, because 
work usually fails to distinguish E-language and I-language changes and encoun- 
ters problems of circularity: the new grammar is most successful in generating the 
structures of the new system but that presupposes that the new structures are al- 
ready available in order to explain the emergence of the new grammar. This is part 
of a larger problem: if one asks a syntactician how children can learn some gram- 
matical property, she will point to sentences that are generated in part through the 
effects of the relevant grammatical property, taking those sentences to be the nec- 
essary PLD. This circularity will become clear when we discuss specific changes. 
A cue-based approach (Dresher 1999, Fodor 1998, Lightfoot 1999) is not sub- 
ject to the feasibility problems of grammar evaluation and treats children as look- 
ing for certain structures expressed by the PLD, the cues, i.e. the structures needed 
in order to parse the sentence. There may be a thousand or more possible cues, 
provided by UG, but that does not present the feasibility problems of evaluating 
the success of thirty or forty binary parameters against corpora. Once children 
have an appropriate set of cues, the resulting I-language generates what it generates 
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and the overall, gross set of structures plays no role in triggering or selecting the 
grammar. This model of acquisition keys elements of grammar to particular ele- 
ments of the PLD and provides good explanations for diachronic shifts and the 
emergence of new grammars. The model of acquisition allows us to link a cue with 
the PLD that express that cue and that yields some surprising results that force us 
to think about triggering experiences differently. 

So a person’ internal language capacity is a complex system that depends on 
an interaction between learned operations and principles that are conveyed by the 
genetic material, directly and indirectly. It grows in children in response to the E- 
language that they encounter, the source of the cues, and becomes part of their 
biology. If language growth in young children is viewed in this way, then we can 
explain language change over generations of speakers in terms of the dynamics of 
these complex systems. In particular, we can explain how languages shift in bursts, 
in a kind of punctuated equilibrium, and we can explain the changes without in- 
voking principles of history. 


First case study 


Consider two structural shifts that English has undergone, phase transitions that 
are now well understood. 
Modern English has forms like (2-6a) but not (2-6b). 


(2) He has understood Chapter 4 


a 
b. *He has could understand Chapter 4 


(3) Understanding Chapter 4, ... 


a 
b. *Canning understand Chapter 4, ... 


(4) a. He wanted to understand 
b. *He wanted to can understand 
(5) a. He will try to understand 


b. *He will can understand 


(6) a. Heunderstands music 
b. *He can music 


However, earlier forms of English had the b forms, which were used by speakers 
up to Sir Thomas More in the early sixteenth century. More used all the forms of 
(2-6) and the b forms occur in nobody’s writing after him. (7-9) provide some 
examples of the latest occurrences of the obsolescent forms, (7) corresponding to 
(5b), (8) to (4b), and (9) to (2b). 
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(7) I fear that the emperor will depart thence, before my letters shall may 
come unto your grace’s hands (1532, Cranmer, Letters) 

(8) That appered at the fyrste to mow stande the realm in grete stede 
(1533, More, Works 885 C1), ‘appeared at first to be able to stand the realm 
in good stead’ 

(9) Ifwee had mought convenient come togyther, ye woulde rather haue cho- 
sin to haue harde my minde of mine owne mouthe 
(1528, More, Works 107 H6), ‘if we had been able to come together conve- 
niently, ...? 


‘There is good reason to believe that there was a single change in people’s internal 
systems such that words like can, could, must, may, might, will, would, shall, should 
and do were once categorized as more or less normal verbs but then they were re- 
categorized as Inflectional elements in all known grammars of English speakers 
after the time of More. Before More, verbs like can moved to a higher Inflection 
position, as in (10), and after More they were generated directly as Inflectional ele- 
ments and occurred in structures like (11), a single shift in the system, which was 
manifested by the simultaneous loss of the phenomena in (2-6b), the phase transi- 
tion; sentences like (2-6b) are not compatible with a system with structures 
like (11). The stncuLariry of the change explains the PARALLELISM in the loss of 
phenomena. 


(10) 
ca 
Spec 
gk, 
I 
= Js 
Vv 
can see “ue Ss 
(11) IP 
BOS 
Spec IP 
a 
can a 
VP 
ne stars 


This change occurred only in Early Modern English and nothing comparable hap- 
pened in any other European language, so it is not satisfactory to say that this 
change is to be explained by a general tendency to grammaticalize or to re-categorize 
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modal verbs as members of a functional category. If there is a general tendency, 
why has it not yet affected any other European language? In particular, some have 
revived historicist claims and argued for “UG biases” as an internal driver to ex- 
plain grammaticalization phenomena (van Gelderen 2011, Roberts & Roussou 
2003). This enables them to unify some phenomena, which provides a level of ex- 
planation. The change of category membership for the English modals is a parade 
case of grammaticalization but saying that it results from an internal drive or a UG 
bias gives no explanation for why it happened when it did nor under what 
circumstances and does not explain why the change hasn’t happened in any other 
European language. 

A critical property of this change is that it consisted entirely in the Loss of 
certain phenomena (2-6b) and there were no new forms emerging. Since children 
converge on their I-language in response to ambient simple expressions and not in 
response to “negative data” about what does not occur, the new, more limited data 
need to be explained by a new abstract system that fails to generate (2-6b). There 
were no new forms in which the modal auxiliaries began to occur and the trigger 
for the new system must lie elsewhere. In this case, the new PLD cannot be the 
new output of the new grammars, because there are no new forms. Changes like 
this, which consist only in the loss of expressions, make up a kind of poverty-of- 
stimulus argument for diachrony: there appear to be no new forms in the PLD that 
directly trigger the loss of those expressions. 

If we ask why this or any other I-language change happened, there can only be 
one answer under this approach: children came to have different primary linguis- 
tic data (PLD) as a result of a prior change in external language. We have a good 
hypothesis about what the prior E-language change was in this case. 

Early English had complex morphological properties. For example, we find 
fremme, fremst, fremp, fremmap in the present tense and fremed, fremedest, fre- 
mede, fremedon in the past tense of ‘do;’ séo, siehst, siehp, seop in the present tense 
for ‘see;’ ride, ritst, ritt, ridap for the present tense of ‘ride; and rad, ride, rad and 
ridon for the past tense. There was a massive loss of verbal morphology in Middle 
English, beginning in the north of England and due to intimate contact with 
Scandinavian speakers (O’Neil 1978). Again I skip interesting details but external 
language that children heard changed such that the modern modal auxiliaries like 
can, shall, etc. came to be morphologically distinct from other verbs, because as 
the members of the small preterite-present class, they lacked the one surviving 
feature of present tense verb morphology, the -s ending of the third person singu- 
lar. Furthermore, their “past tense” forms (could, would, might, etc.) had meanings 
that were not past time, reflecting old subjunctive uses (12). The evidence indi- 
cates that these modal verbs were recategorized in people's internal systems, be- 
cause they had become formally distinct from other verbs as a result of the radical 
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simplification of morphology. So we see domino effects: changes in what children 
heard, the newly reduced verb morphology, led to a different categorization of 
certain verbs, which yielded systems (11) that were compatible with (2—-6a) but not 
(2-6b). 


(12) They might/could/would leave tomorrow. 


More was the last known speaker with the old system. For a period, both systems 
co-existed: some speakers had (10) and others had (11), the former becoming 
rarer over time, the latter more numerous. A large literature is now devoted to this 
kind of sociological variation, changing over time, and we will return to this mat- 
ter in a moment. 


Second case study 


A later major change was that English lost (13-15a), another phase transition. 
Such forms occurred frequently in texts up through the seventeenth century, al- 
though diminishing over a long period in favor of the do forms of (13-15b). 


(13) *Understands Kim Chapter 4? 


a 
b. Does Kim understand Chapter 4? 


(14) a. *Kim understands not Chapter 4 
b. Kim does not understand Chapter 4 


(15) a. *Kim reads always the newspapers 
b. Kim always reads newspapers 


Again we can understand the parallelism of the three changes in terms of a single 
change in the abstract system, namely the loss of the operation moving verbs to a 
higher Inflection position (16). This is another change peculiar to English and not 
affecting any other European languages (apart from some Scandinavian dialects), 
whose systems have retained the verb movement operation. In present-day 
English verbs do not move to the higher position and therefore cannot move to 
clause-initial position (13a), to the left of a negative (14a), or to the left of an ad- 
verb (15a). The equivalent movements continue to occur in French, Italian, Dutch 
and German systems. Again a contingent explanation is required: what was it 
about English at this time that led to this shift in I-languages? In particular, what 
were the new PLD that children were exposed to that helped to trigger the new 
grammar? 
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(16) IP 
Spec IP 
i aa 
see stars 


It is reasonable to think that this shift was due to two prior changes and we see 
another domino effect. The first was the new I-language that we just discussed, 
involving the recategorization of modal verbs, and the second was the emergence, 
first in the Westcountry, of “periphrastic” do forms as an alternative option for 
expressing past tense: John did leave, John did not leave, etc., instead of John left and 
John left not (McWhorter 2009 offers an interesting analysis, attributing the new 
do forms to Cornish influence). As a result of these changes, a new I-language in- 
fluencing what children heard in external language and new, independent changes 
in E-language involving the encroaching do, the Inflection position came to be 
occupied by modal auxiliaries and by do in internal systems and was not available 
as a target for verb movement in those instances. Thus, lexical verbs did not occur 
in that position as often as before the days of periphrastic do and before modal 
auxiliaries were no longer verbs, and as a result, the ,[V] structure fell below the 
threshold that had permitted its acquisition by children. 

As with the new modal items, the two systems co-existed for a while, in fact 
for a longer period than with the recategorization of the modal verbs: Shakespeare 
and other writers alternated easily between the coexisting old and new systems, 
sometime using the old V-to-I forms and sometimes the new do forms, even with- 
in the same sentence (17). 


(17) a. Where didst thou see her? - O unhappy girl! - With the Moor, say’st 
thou? 
b. I like not that. // What dost thou say? 
c. Alas, what does this gentleman conceive? - How do you, madam? 


Again this is too brief an account but I hope to have made clear that two prior 
changes in external language, some due to a shift in I-languages, had the effect of 
reducing enormously children’s evidence for the ,[V] structure, triggering a new 
internal system, and that three simultaneous but apparently unrelated changes 
were a function of a single change in the abstract system.? 


2. Many people have contributed to our current understanding of these two phase transitions. 
Roberts (2007) gives a detailed account of both changes in I-languages. 
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Variation 


Amorphous external language and internal systems are different in kind and the 
modern distinction between external and internal language is crucial; both play a 
role in explaining change (Lightfoot 2006). In addition, there is interaction be- 
tween nature and nurture, and also within the system, syntax interacting with the 
articulatory and semantic systems, providing pairings between sound and mean- 
ing over an infinite range. 

The language capacity is a complex adaptive system. Children are exposed to 
speech and their biological endowment, a kind of toolbox, enables them to inter- 
act with their external linguistic experience, their E-language, thereby growing a 
private, internal system that defines their linguistic capacity. 

Since the systems are complex and adaptive, they involve particular abstract- 
ions, categories and operations and these, not the behaviors themselves, constitute 
the real points of variation and change. Phenomena do not change in isolation but 
they cluster, depending on the abstract categories involved. As a result, change is 
bumpy and takes place in a kind of punctuated equilibrium. We explain the bumps, 
the clusters of changes, in terms of changes in the abstract system, as we illustrated 
in the two phase transitions in the history of English. If we get the abstractions 
right, we explain why phenomena cluster in the way they do. 

Everybody’s experience varies and people's internal systems may vary, but not 
linearly. Also I-languages change over time, and sometimes variation in experi- 
ence crosses thresholds and triggers the development of a different internal sys- 
tem. Children are sensitive to variation in initial conditions, in the terminology of 
chaos theory. In general, we understand change in internal systems through the 
acquisition process, by virtue of children experiencing different E-language. We 
explain changes where we can identify changes in the external language that chil- 
dren are exposed to such that the new experiences trigger different internal systems 
with different categories and operations. For example, after the comprehensive 
morphological changes of Middle English, young children had different experi- 
ences that led them to categorize words like may and must differently from verbs 
like run and talk. Assigning these words to a different category, Inflection, explains 
why (2-6b) all disappeared in parallel. Similarly new forms resulting from modal 
verbs being treated as Inflectional items and new forms with periphrastic do en- 
tailed that the ,[V] cue was expressed less robustly and fell out of use, entailing the 
obsolescence of (13-15b). 

Under this approach, change is contingent, dependent on particular circum- 
stances, and we explain why English underwent at this time two changes that oth- 
er European languages have not undergone. English had particular morphological 
properties that were affected in particular ways by contact with Scandinavian 
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speakers and that led to the new categorization. Other European languages were 
not affected in that way. If change is contingent like this, then there is no general 
direction to change and there is no reason to believe that languages all tend to 
become more efficient, less complex, or less anything else. There are no general 
principles of history of the kind that nineteenth-century thinkers sought and ex- 
planations are local (Lightfoot 2011). There is no reason to revive historicism or 
invoke principles of history. 

This approach to syntactic change also provides a new understanding of syn- 
chronic variation, along the lines of Labov’s discussion of sound change in progress 
(e.g. Labov 1972). When a phase transition (a catastrophe, a rapid, structural shift) 
takes place, it does not happen on one day. Rather, a new I-language emerges in 
some children and spreads through the population, taking over from the old I-lan- 
guage, sometimes over the course of a century or more (but typically not for a very 
long period). Co-existing grammars (Kroch 1989, 1994) explain the nature of cer- 
tain variation within a speech community: in this context, one does not find random 
variation in the texts but oscillation between two (or more) fixed points. In general, 
writers either have all the forms of the obsolescent I-language or none. Not all varia- 
tion is to be explained in this way, of course. There is also variation in E-language 
that has little if anything to do with I-language. In amorphous E-language variation 
is endemic: no two people experience the same E-language and, in particular, no 
two children experience the same primary linguistic data. Since E-language varies 
so much, there are always possibilities for new I-languages to be triggered. 

At a synchronic level, competing grammars also provide a new understanding 
of apparent optionality of computational operations. If operations may be optional 
or obligatory, that presents severe learnability problems, because the need for oblig- 
atory operations is usually based on precluding what does not occur. That consti- 
tutes “negative data,’ which are not available to young children. Positing competing 
grammars enables the elimination of an optional/obligatory distinction: rather, 
there are competing grammars, one allowing an operation, the other not. 

A recent development that has revolutionized work in diachronic syntax has 
been the development of partially parsed corpora of historical prose texts. This 
began with corpora of texts from various stages of English emanating from the 
University of Pennsylvania in collaboration with faculty at other universities in 
other countries. More recently corpora have been developed for other languages, 
including Portuguese, French, Icelandic, Hungarian and Early High German. I 
provide a list of the existing corpora at the end of the paper. Researchers can search 
for structures over vast amounts of data. 

‘These corpora open the possibility of searching for coexisting structures and 
determining whether the coexisting structures cluster in such a way as to manifest 
competing I-languages, oscillation between two or more fixed points, or whether 
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the variation manifests the randomness of variable E-language. There are different 
patterns of variation and this, in turn, illuminates the distinction between new 
I-languages and new E-language that might trigger a new I-language. That en- 
riches our hypotheses about language acquisition, providing data that bear on the 
distinction between the causes and effects of syntactic change. 


Identifying triggers 


So the major contributions of diachronic work in syntax lie in explaining one kind 
of variation, due to coexisting I-languages, and in revealing what the E-language 
trigger might be for any particular property of I-languages. 

It is surprising how little discussion there has been of what triggers what prop- 
erties, given the explanatory schema of (1). 

Consider the Binding Theory, for example, which postulates that children 
learn which nouns are anaphors, pronouns and names (Chomsky 1986, etc.). 
Given the Binding Theory, once a child has learned that themselves is an anaphor, 
her a pronoun, etc., all the appropriate indexing relations follow, with no further 
learning. Then one must ask how this is learned. Exposure to a simple sentence 
like (18a), interpreted with themselves referring to they (coindexed), suffices to 
show that themselves is an anaphor and not a pronoun or name; pronouns and 
names may not be coindexed with c-commanding element within its Domain. 


(18) a. They, washed themselves.. 
b. Kim,’s father loves her,. 
c. Kim, heard DP [Bill's speeches about her,]. 
d. Kim left. 


(18b), interpreted with her referring to Kim, shows that her is no anaphor and (18c), 
with her referring to Kim, shows that her is not a name; the Domain of her is the DP 
indicated and her is free within that DP. If neither an anaphor nor a name, then her 
is a pronoun. A very simple expression like (18d) shows that Kim is not an anaphor, 
but there is no positive evidence available to a child showing that Kim is not a pro- 
noun. Analysts know that Kim is not a pronoun, because one does not find sen- 
tences of the form Kim said that Kim left, with the two Kims referring to the same 
person, but that is a negative fact concerning something which does not occur, 
hence unavailable to young children. That suggests a hierarchical organization. 
One approach is to say that children initially take every word as a name, unless 
there is positive, refuting evidence. Under that view, sentences like (18a) show that 
themselves is an anaphor, and not a pronoun nor a name. And (18c) shows that her 
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is not a name, because it is coindexed with the DP Kim, and not an anaphor, be- 
cause it is not locally coindexed. This yields a satisfactory account. We have a the- 
ory of mature capacity that provides the appropriate distinctions and one can show 
how children learn from environmental data which elements are anaphors and 
which are pronouns; everything else is a name and no other learning is relevant. 

My point here is that postulating the Binding Theory is not enough and one 
needs to consider the learning involved and that often involves substantive issues. 

We should allow for the possibility that the PLD that trigger a grammatical 
property may not have any obvious connection with that property. Indeed, our 
discussion of the recategorization of modal verbs was triggered by new morpho- 
logical properties. 

Niko Tinbergen (1957: ch. 22) once surprised the world of ethologists by 
showing that young herring gulls’ behavior of pecking at their mothers’ mouths 
was triggered not by the fact that the mother gull might be carrying food but by a 
red spot. Mothers typically have a red spot under their beak and Tinbergen de- 
vised an ingenious series of experiments showing that this was the crucial trigger- 
ing property. So the chicks would respond to a disembodied red spot and not to a 
bird carrying food but lacking the strategic red spot. Similarly, grammars may have 
certain mechanisms and devices because of properties in the primary data that are 
not obviously related to those mechanisms, as we saw in our first case study. 

People have their own internal system, a grammar, which develops in them in 
the first few years of life as a result of an interaction between genetic factors com- 
mon to the species and environmental variation in primary linguistic data. Such a 
grammar represents the person's linguistic range, the kind of things that the per- 
son might say and how he/she may say them. If they hear different things, children 
may converge on a different system, a new grammar, perhaps the first instance of 
a particular, new I-language. We want to find out what triggers which aspect of a 
person's I-language, therefore how new I-languages might emerge. 

Many people have claimed that we can discover things about grammars by 
seeing how they change. Anderson (1988) put it graphically: ‘Like an animal stand- 
ing against the background of a forest, the outlines of a grammar can be thrown 
into sudden relief when something changes’ In particular, by looking at change 
one can generate productive hypotheses about what PLD trigger particular prop- 
erties of I-languages. 


Postscript: Limits 


We may have achieved ideal explanations for certain syntactic changes in terms 
of how children acquire their I-language but it is not clear that this mode of 
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explanation extends to systematic phonological shifts like those of Grimm or 
Grassmann’s Laws. It is unclear how new PLD could have triggered the systematic 
changes in Germanic consonants and it may be that other forces are at work in 
phonology, where poverty of stimulus reasoning has not played a major role.’ Per- 
haps there are internal forces that drive sound change. For example, many changes 
can be understood in terms of an internal drive to simplify articulation: Latin octo 
has become Italian otto, Swedish drikka and takka have eliminated the nasal of 
‘drink and ‘thank; and English family has become famly. Listeners sometimes in- 
terpret what they hear differently from what was intended and change their pro- 
nunciation accordingly, reanalyzing the PLD. Often this is influenced by matters of 
group identity, people adopting new pronunciations as a way of identifying them- 
selves with a group; such factors have not been shown to play a role in syntactic 
change. Internal drivers may play a role in phonological change but, in general, 
they are more limited in their explanatory power than the contingent, acquisition- 
based approach taken here and offer no grammatically based explanation for why 
the changes take place. 
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The acquisition of linguistic variation 


Parameters vs. micro-cues 


Marit Westergaard 
CASTL, University of Tromso 


Against the backdrop of some traditional generative work on parameters as well 
as some recent constructivist work, this paper discusses how children handle 
variation that they are exposed to in the input. Investigating different types of 
word order variation, the paper finds no evidence for parameters in language 
acquisition data and instead argues for a model of micro-cues, according to 
which children are sensitive to fine distinctions in syntax and information 
structure from early on. Furthermore, the generalizations that are made in the 
acquisition process are argued to apply only to the relevant class or subcategory. 
This accounts for the lack of (over-)generalization in child data, often referred to 
as conservative learning. 


1. Introduction 


Word order variation in traditional generative work has often focused on variation 
across languages, typically accounted for by different settings of parameters given 
by Universal Grammar (UG). This paper discusses various types of word order 
variation that is found within languages and thus in the input that children are 
exposed to in the acquisition process. Some of this variation represents exceptions 
to what has been argued to be a parameter (OV/VO or verb second (V2)), while 
there is other language-internal variation that has not been argued to be part of a 
parameter (e.g. different subject positions or word order in possessive construc- 
tions). In all cases it is shown that children produce both word orders from the 
earliest attestable stages, making the same fine distinctions in syntax and informa- 
tion structure (IS) as in the target language. The paper thus makes the following 
main argument: Since all this complex variation has to be learned from the input, 
and children are clearly quite good at it from early on, there should be no need for 
UG parameters to explain the acquisition of basic word order facts. Instead, the 
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paper presents a model of micro-cues, arguing that children do not focus on gen- 
eral rules first, but build up their grammar from small pieces of structure, extend- 
ing it in a step-wise fashion. 

‘The paper is organized as follows: The next section gives an overview of some 
traditional generative work on parameters, as well as a brief discussion of work 
within the constructivist camp, focusing on null subjects and head-complement 
word order respectively. Section 3 discusses some cases of word order variation in 
the input, mainly from my own work on English and Norwegian, and provides an 
overview of acquisition data that show that children typically have no problems 
acquiring fine distinctions in syntax and information structure that cannot be part 
of a parameter. In Section 4, I present the model of micro-cues, focusing on con- 
servative learning, and Section 5 is a brief summary. 


2. Background 


2.1 Traditional generative work 


Within much traditional generative work, it has been a common assumption that, 
in order to be learnable, human grammars must consist of (invariant) universal 
principles as well as a number of parameters which account for the variation across 
languages. This model of grammar, normally referred to as the Principles and Pa- 
rameters framework, is argued to account for the ease and speed of language ac- 
quisition, as expressed in the following quote from Pinker (1994: 112): 


All [children] have to learn is whether their particular language has the param- 
eter value head-first, as in English, or head-last, as in Japanese. They can do that 
merely by noticing whether a verb comes before or after its object ... Huge chunks 
of grammar are then available to the child, all at once, as if the child were merely 
flipping a switch to one of two possible positions. [emphasis MW] 


The idea that UG consists of such two-way parameter switches has been immense- 
ly popular and therefore persistent within syntactic theory. The attractiveness of 
this framework was formulated by Valian already in 1990 in the following way 
(Valian 1990: 105): 


Parameter-setting theories of language acquisition are attractive because they 
directly address a deep and basic puzzle of acquisition: children learn language 
rapidly - any language - yet languages seem to vary enormously ... As a theory of 
language, parameter-setting regularizes and systematizes the variety of languages. 
[emphasis MW] 
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Within the Principles & Parameters approach to language, children’s mistakes 
were often considered to be the result of parameter missetting. One of the earliest 
and perhaps most quoted example of this is found in Hyams (1986), where it is 
argued that all children start out with a pro-drop grammar, i.e. with the null sub- 
ject parameter set for [+pro-drop], as in Italian. This would account for the ap- 
pearance of null subjects in early child language, which is frequently found also in 
non-null-subject languages, as illustrated in (1) from English (examples from 
CHILDES, MacWhinney 2000, Brown corpus, Brown 1973) and in (2) from 
Norwegian (examples from the Troms@ corpus, Anderssen 2006). 


(1) want some grape juice. (Eve.03, age 1;7) 


(2) tegne ikke (s)tolen. (Ann.03, age 1;10.2) 
draw.PRESs not chair.DEF 
‘(?'m) not drawing the chair? 


However, it was soon pointed out by Valian (1990, 1991) that children acquiring 
English are very different from e.g. Italian children with respect to the proportion 
of null subjects in spontaneous speech. She shows that, while American children 
(average age 2;0) produce 69% overt subjects, Italian children of a corresponding 
age produce only 30%. English-speaking children also produce many more pro- 
nominal subjects than Italian children. This means that English-speaking children 
are both quantitatively and qualitatively different from Italian children with re- 
spect to the production of null subjects already at a very early stage, indicating that 
they “know” that they are acquiring a non-null-subject language. 

Valian (1990) also argues that the idea of parameter missetting is problematic 
in various ways, especially with respect to the question of the initial setting. Focus- 
ing on the pro-drop parameter, she shows that assuming either setting as the initial 
one would lead to problems. One of the reasons for this is that, even in non-null- 
subject languages, there are occasional examples in the input where the subject is 
dropped. Thus, if the null subject parameter is initially set for [+pro-drop], chil- 
dren learning a non-null-subject language such as English, German or Norwegian 
would have positive evidence for both values and, due to this conflicting evidence, 
they would never be able to reset the original value of the parameter. This would 
also apply if one assumed that there is no initial setting. Initially setting the null 
subject parameter for [—pro-drop] would not fare much better. In that case, chil- 
dren learning a pro-drop language would start out with an English-type grammar. 
However, Italian-like input would then simply be interpreted as ungrammatical 
(or as bare VPs), and this would not change the parameter to a [+pro-drop] setting 
for these children. If it did, English children would be expected to change their 
grammar when hearing occasional subjectless sentences, which clearly they do not 
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do. An initial [-pro-drop] setting of the parameter would thus mean that children 
learning a pro-drop language would never be able to set the parameter correctly. 
Furthermore, this goes against common findings in child language data, cf. ex- 
amples (1)-(2), suggesting that development is the other way around. 

In this connection, Valian considers whether indirect negative evidence may 
be useful in these situations. Referring to Lasnik (1983), she speculates that, if the 
initial setting is [+pro-drop], the child should somehow ‘expect’ to hear null sub- 
jects. If that does not happen, the parameter may then be reset for [-pro-drop]. 
However, she discusses the problematic issue of negative evidence in relation to 
parameter setting in the following way (Valian (1990: 114)): 


Indirect negative evidence is an example of the kind of reasoning that parameter- 
setting was intended to eliminate as a feature of language acquisition. Pure 
parameter-setting as a model of acquisition cannot legitimately incorporate a 
notion like indirect negative evidence, because it requires the child to evaluate 
and weigh evidence. The reasoning of indirect negative evidence is the reason- 
ing of hypothesis-testing and theory construction, not the automatic triggering 
of parameter-setting. 


Valian then discusses the idea of both values being available simultaneously, and 
ends up endorsing this proposal. However, this entails “beefing up” the learning 
mechanism, meaning that children must be able to detect minor distinctions in 
the input. For the pro-drop parameter this means e.g. paying attention to the dis- 
tribution of null subjects and distinguishing between main and embedded clauses 
(null subjects being restricted to initial position and main clauses in English). She 
concludes that ‘a wealth of data suggest that children are adept at distributional 
analysis’ (Valian 1990: 118). This means that early generative work recognized the 
need to modify the idea of parameter setting as an automatic process in language 
acquisition. 

But this did not lead to a rejection of parameters within mainstream genera- 
tive theory. Instead, Wexler (1999) introduced the principle of Very Early Param- 
eter Setting (VEPS), arguing that parameters are set even before children start 
producing relevant utterances, and this is the reason why there are no traces of this 
process in children’s production. Non-target-consistent production therefore can- 
not be due to parameter missetting, and Wexler argues instead that children’s 
errors are due to certain constraints that have not matured yet in children’s gram- 
mars, e.g. the Unique Checking Constraint, which is argued to be responsible for 
the production of optional infinitives in various child languages. 

Later research on subjectless sentences in various languages has shown that 
matters are even more complex than what Hyams (1986), Valian (1990, 1991) and 
others assumed, in that the conditions on null subjects may be very different from 
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what they are in Italian. In Russian, for example, null subjects are related to IS, in 
the sense that subjects may be dropped if they convey given information in the 
discourse. However, Russian differs form Italian in that overt subjects do not carry 
any additional emphasis. Nevertheless, Gordishevsky & Avrutin (2004) show that 
children acquiring Russian have no trouble figuring out under what conditions 
null subjects are to be used. Thus, they argue that IS is also part of early linguistic 
knowledge. Some overuse of empty elements in early child data is explained as a 
result of children’s lack of the Concept of Non-shared Knowledge (Schaeffer 2000), 
which means that they treat some subjects that are new to their interlocutors as 
given information. Comparing child English, Italian and Russian, Zdorenko 
(2005) also shows that there is some delay in the production of overt subjects in all 
three languages, but that children at the same time have already zoomed in on the 
constraints of the target grammar. 

In a recent paper, Smith & Law (2009) make a three-way distinction between 
what they refer to as Universal Principles, Parameters, and Accidents. According 
to their argumentation, parametric choices must satisfy the following seven crite- 
ria: they must be cognitively represented, systematic, dependent on the input, de- 
terministic, discrete, mutually exclusive, and irreversible. Thus, they claim that a 
lot of variation previously argued to be due to a parameter is really due to acci- 
dents. Examples of ‘real’ parameters, according to their definition, are still the 
“good old ones’, i.e. pro-drop and certain word order choices across languages. In 
my view, if there is any reality behind this distinction, we would expect there to be 
a major difference in the acquisition of parameters on the one hand and accidents 
on the other, in that acquisition of the true parameters should be quick and auto- 
matic and accidents more problematic. As I will show in Section 3, there is no evi- 
dence for this in Norwegian and English child data on word order. 


2.2 Constructivist work 


‘The constructivist program has in some ways been created as an anti-generative 
enterprise, with a strong skepticism towards parameters (Tomasello 2005). In this 
section I briefly review a classic constructivist study on the acquisition of word or- 
der, viz. Akhtar (1999). The claim in much of the constructivist literature is that 
children at an early stage do not have a general understanding of word order, e.g. 
SVO or SOV that supposedly results from setting the head-parameter. Instead, it is 
argued that children rely on verb-specific formulas, e.g. [NPpusher-form of the verb 
PUSH-NPpushee]. Learning word order is not the result of setting a parameter for 
e.g. VO or OV, but is the result of a data-driven gradual process, where word order 
is acquired in an item-based fashion verb by verb, eventually involving generalization 
from a high number of examples. Akhtar (1999) carried out an experiment with 
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English-speaking children in three age groups (two-, three- and four-year-olds), us- 
ing novel verbs in three different word orders, as illustrated in (3). 


(3) SVO, e.g. Elmo dacking the car 
SOV e.g. Elmo the car gopping 
VSO e.g. Tamming Elmo the car 


The children were exposed to the novel verbs in only one of these word orders, and 
were then asked to produce sentences with the relevant verbs. The prediction made 
by parameter-setting models of language acquisition would be that young children 
would not use the ungrammatical SOV and VSO orders, because they have already 
set the head-direction parameter to the value for English (SVO). Constructivist 
models, on the other hand, would expect children to repeat the word order that 
they had heard with the different novel verbs, because they acquire word order on 
a verb-by-verb basis. 

The results showed that the younger age groups were equally likely to produce 
non-English orders spontaneously as to use SVO, whereas the oldest children con- 
sistently corrected to SVO. In the control condition, on the other hand, where the 
children were asked to produce similar sentences with familiar verbs, also the 
youngest children corrected the ungrammatical word orders to SVO. The explana- 
tion offered for this is that the children did not accept the non-English word order 
with verbs for which they had already constructed a schema. 

Results such as these are difficult to explain in generative models. One may be 
tempted to refer to factors such as experimental noise, as well as problems in- 
volved in exposing children to ungrammatical input. Nevertheless, there is an in- 
creasing number of studies of this kind in the constructivist literature, and in my 
view, they can no longer be ignored in the generative camp. The micro-cue model 
presented in this paper is an attempt to make sense of results such as these within 
a generative framework. 


3. Variation in the input 


As discussed in the previous section, parameters were originally argued to take 
care of the variation across different languages; that is, to explain why some lan- 
guages have VO word order while others have OV, or why only some languages 
allow null subjects while others do not. But there is of course a lot more variation 
in languages than what these major parameters indicate. This becomes especially 
obvious when one studies several dialects of a language or diachronic texts. Fur- 
thermore, there is considerable variation within languages. This means that chil- 
dren are often exposed to input that is compatible with both settings of a given 
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parameter, and it is therefore an important question how they set parameters to 
the right values in such situations. Do they set a parameter based on exposure to a 
few examples (as originally assumed) and then, in case they were wrong, reset it? 
And do they overgeneralize this setting at an early stage and then learn all the ex- 
ceptions later? 

In the following sections I consider two types of variation; in 3.1 I discuss in- 
put that constitutes exceptions to alleged parameters and in 3.2 variation that has 
typically not been argued to be part of a parameter. In both cases I also provide 
data on how children deal with this variation. 


3.1 Variation in parameters 


3.1.1 Variable VO/OV 

‘The perhaps most central parameter argued for in the literature is the head param- 
eter, among other things responsible for whether a language has the object preced- 
ing or following the verb. Most present-day standard European languages have 
either one or the other of these word orders. However, considering data from his- 
torical languages, e.g. Old and Middle English (Pintzuk 1991, 2005, Roberts 1997) 
or Old Icelandic (Hrdéarsdéttir 2004), one finds that both word orders are attested, 
as exemplified in (4) and (5) from Old English. Also present-day languages ex- 
hibit this variation, a prime example being Russian, as discussed in Dyakonova 
(2003, 2004, 2009), illustrated in (6) and (7). 


(4) Wene magan eow neadian (Old English) 
we NEG can you constrain 
“We cannot constrain you...’ (Pintzuk 2005: 129) 
(5) Se wolde gelytlian pone lyfigendan helend 
he would diminish the _ living saviour 
‘He would diminish the living saviour ..’ (Pintzuk 2005: 117) 
(6) Moi sosed odolzhil mne slovarj. (Russian) 
my neighbour lent me dictionary 


‘My neighbour lent me a dictionary: 


(7) Moi sosed mune slovarj odolzhil. 
my neighbour me dictionary lent 
‘It’s the dictionary that my neighbour lent me: (Dyakonova 2004: 92) 


The word order variation in Old English or present-day Russian is not completely 
free, but has been found to be dependent on fine syntactic distinctions (e.g. wheth- 
er the object is negated or quantified) as well as specificity or information struc- 
ture (whether the object is given or new and/or focused information). In Russian, 
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topical objects (including most pronouns) must scramble to preverbal position in 
order not to be interpreted as part of the Information Focus. 

What do children who are exposed to this word order variation do? If children 
were initially setting a parameter for either head-first or head-last, we would ex- 
pect to find only one word order at an early stage and overgeneralization in those 
cases where the other word order is required. Obviously, there are no data from 
children acquiring historical languages, but data from a Russian child are attested 
in Dyakonova (2003, 2004). She finds that both OV and VO word orders are pro- 
duced in appropriate contexts from early on, illustrated in (8) with a specific object 
and (9) with a non-specific object. 


(8) a. Ja moju rech zapisyvaju. (Varvara 1;10) 
ILNOM my _ speech.acc record 
‘Tam recording my speech’ 


b. Hochu vot eto pochitat. 
want.lsG this one.acc read 
‘T want to read this one? 


(9) Kubik est kosti. (Varvara 2;0) 
Kubik eats bones.acc 
“Kubik eats bones: (from Dyakonova 2003: 60) 


This finding indicates that, if there is early parameter setting (presumably for VO, 
which is considered to be the basic word order), then there is equally early acquisi- 
tion of the exceptions to this parameter (OV with specific or informationally given 
objects). But there is no evidence that the child at any stage overuses the basic 
word order. The Russian child is also found to distinguish between DP and pro- 
nominal objects (placing the latter almost exclusively in preverbal position, 
cf. (8b)), showing that she is also able to make fine categorial distinctions in the 
acquisition process. The only non-target-consistent production found in the child 
data is a very slight delay in OV order with pronominal objects and a certain over- 
use of this word order with DP objects. Dyakonova explains the latter finding in 
terms of a lack of Schaeffer’s (2000) Concept of Non-shared Knowledge: That is, at 
an early stage, children are able to make a grammatical distinction between differ- 
ent objects with respect to specificity/information structure in relation to syntactic 
position, but tend to interpret more objects as given information, not being aware 
that they may be new to their interlocutors. 


3.1.2 Variable V2 
Another word order that is typically argued to be due to a parameter is V2, the 
requirement found in most Germanic languages that the verb appear in second 
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position in the clause (the C position). Present-day English only has remnants of 
this word order in the form of subject-auxiliary inversion in main clause ques- 
tions, as illustrated in (10)-(12). This means that V2 is restricted both in terms of 
clause type and verb type (auxiliaries and be). 


(10) Which wine has Peter tried?/*This wine has Peter tried. 
(11) Let me show you [what I am reading]/*what am I reading. 
(12) What did Peter say?/*What said Peter? 


Also historical versions of English displayed variation with respect to V2, in that 
this word order in declaratives was dependent on a number of factors, e.g. the 
nature of the initial element and the information value of the subject, as illustrated 
in (13) and (14) from late Middle English (see e.g. Bech 2001, Eitler 2006, 
Westergaard 2009d, van Kemenade & Westergaard 2012). 


(13) This ordre be-gan Seyn Dominice be Zer of oure Lord a Mccxvj. 
(Sermon 147.7) 
“This order was founded by Saint Dominique in the 1216th year of our 
Lord: 


(14) Thre of his dukes he took... (Chronicle 199.30) 
‘He captured three of his dukes ..’ 


In much recent work, e.g. Westergaard (2008a, 2009c), it has been shown that 
there is considerable variation with respect to this word order also within and 
across present-day V2 languages. The variation may for example be dependent on 
clause type, such that V2 is required in most declaratives in Norwegian, while 
non-V2 is required in exclamatives and most embedded clauses. At the same time, 
some exclamatives in Danish and most embedded clauses in Icelandic require V2. 
This variation across V2 languages means that this must be learned from input. 

Like in the history of English, V2 variation in Norwegian may also be depen- 
dent on the initial element or patterns of information structure (Westergaard 2003, 
2009c): While most declaratives require V2, this word order is variable after the 
adverb kanskje ‘maybe’ as shown in (15), and while V2 is variable in questions 
with short question words (which are argued to be heads), it is required after disyl- 
labic wh-elements (which are phrases), see the example in (16). Furthermore, the 
variable word order after the short question words is dependent on IS factors, non- 
V2 being preferred when the subject is given information (typically a pronoun), 
see the authentic examples from a corpus in (17a, b).' 


1. The examples are taken from an adult in an acquisition corpus. For more information on 
this corpus, see Anderssen (2006) or Westergaard (2009a). 
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(15) Kanskjekommer kongen./Kanskje kongen kommer. V2/Non-V2 
maybe come.pREs king.DEF 
‘Maybe the king will come? 


(16) Korfor gar du?/*Korfor du gar? V2 
why go you 
“Why are you leaving?’ 
(17) a. kor er skoan hannes henne? (INV, file Ole.17) V2 
where are shoe.DEF/PL his LOC 
‘Where are his shoes?’ 


b. kor demer henne? Non-V2 
where they are Loc 
“Where are they?’ 


How do children fare with all this variation? If they were setting a parameter, we 
would again expect to see massive overgeneralization of one of the two word or- 
ders at an early stage, V2 or non-V2. However, English-speaking children are 
found to produce inversion in main clause questions as soon as relevant contexts 
appear in the data, and do not extend this word order to other verb types or clause 
types (e.g. embedded questions), as illustrated in (18)-(20) (Radford 1992, Roeper 
1999, 2007, Westergaard 2009b). 


(18) where is a box? (Adam, age 3;0.11) 
(19) Sue # what are you folding? (Eve, age 2;1) 
(20) so we can know [where the mailman is]. (Adam, age 332.21) 


Both V2 and non-V2 are also attested early in appropriate contexts in Norwegian: 
‘There is no overgeneralization across clause types, illustrated by the declarative in 
(21) with V2 and the embedded question in (22) with non-V2. Children also pay 
attention to the initial element, as in (23) and (24), and they produce word order 
that adheres to patterns of information structure, see (25) and (26). 


(21) sa  tegne z mamma. (Ina.02, age 1;10.4) v2 
then draw.pRESI mommie 
“Then I draw mommie’ 
(22) se her ka _ Ina gjor. (Ina.04, age1;11.22) Non-V2 
look here what Inado.PREs 
‘Look here what Ina is doing? 
(23) korfor far den ikkje mat? (Ole.16, age 238.5) v2 
why get.PRESit not food 
‘Why doesn’t it get any food?’ 
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(24) kanskje dem krangla. (Ole.14, age 2;6.21) Non-V2 
maybe they fight.pasT 
‘Maybe they were fighting’ 

(25) kor  e babyen? (Ina.06, age 231.0) v2 
where is baby.DEF 
“Where is the baby?’ 


(26) ka du _ skal finne? (Ina.05, age 2:0.5) Non-V2 
what you shall find 
‘What do you want to find?’ 


The only non-target-consistent production attested is an occasional delay in 
subject-auxiliary inversion or V2 word order in specific contexts, which may be 
attributed to economy of movement. For example, some English-speaking chil- 
dren have been found to invert later with modals (compared to the copula) and 
especially with the question word why, see (27). 


(27) Why he can’t hit? (Adam, age 3;4.01) 


A similar delay is occasionally found in Swedish and Norwegian child data, where 
V2 word order in declaratives seems to start out with be, then extend to sentences 
with new and/or focused subjects, and then generalize to all verbs and subject 
types. This development is so rapid in most children that it is not attestable, but 
some evidence is found in Waldmann (2008) for Swedish and Westergaard (2009a) 
for Norwegian, see the non-V2 word order in (28) with a verb other than be, and 
in (29) with an informationally given subject. 


(28) dar deninte ramlar ner pa golvet. (Tea, age 331.05) (Swedish) 
there it not fall.prEs down on floor.DEF 
“There it doesn’t fall down on the floor? 
Target: Dar ramlar den inte ner pa golvet. 


(29) der Ann har et. (Ann, age 231.28) (Norwegian) 
there Ann have.PRES one 
“There Ann has one’ 
Target: Der har Ann et. 


3.2, Word order variation not due to a parameter 


3.2.1 Subject placement 

It has often been recognized that there are two subject positions in languages with V2 
word order, one preceding and one following negation and certain adverbs in non- 
subject-initial clauses. The two subject positions have for example been identified in 
Old English, as illustrated in examples (30a, b), where the subjects appear either 
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preceding or following the discourse particles ba/bonne ‘then. Examples (31a, b) 
show that the two subject positions may also be found in embedded clauses. 


(30) a. Hu meg he donne det lof &done gilp fleon. 
how may he then praise and _—vainglory avoid 
‘How can he avoid praise and vainglory...?’ 


b. Hu gerades meg donne se biscep brucan dere hirdelican are. 
how properly may then the bishop enjoy the pastoral dignity 
‘How, then, can the bishop properly enjoy the pastoral dignity?’ 


(31) a. He nemihte swapeah ~~ efre libban, peah de he _ hine 
he not-could nevertheless ever live though that they him 
pa __utalysde, 
then released 
‘Nevertheless, he could not live forever, though they then released him? 


b. Gif him bonne God ryhtlice & — streeclice deman wile. 
if him then God justly and strictly judge will 
‘If God will then justly and strictly judge him’ 
(van Kemenade & Los 2006: 231) 


That is, the two word orders are either [XP-V-Neg/Adv-S] or [XP-V-S-Neg/Adv], 
and the distinction between them is typically related to IS, the one preceding ad- 
verbs and negation reserved for informationally given subjects and the one follow- 
ing these elements often filled by informationally new or focused subjects. This is 
standardly analyzed as subject movement from the low to the high position and 
thus commonly referred to as Subject Shift. The two subject positions are also 
found in present-day Norwegian (Holmberg 1993, Nilsen 1997, Svenonius 2002, 
Bentzen 2009, Westergaard 2011), and examples of these in main as well as embed- 
ded clauses are provided in (32) and (33). Furthermore, Westergaard (2011) shows 
that, in addition to the IS difference between the two positions, there are also fine 
syntactic distinctions between main and embedded clauses, the high position be- 
ing in the process of becoming a default subject position in embedded clauses. 


(32) Denne boka har ikke Peter lest./Denne boka har Peter ikke lest. 
this book.peFhas not Peter read/this book.peF has Peter not read 
“This book Peter hasn’t read? 


(33) Jegvissteat ikke Peter hadde lest boka/at Peter ikke hadde 
I knewthat not Peter had read book.pEF/that Peter not had read 
lest boka. 
book.DEF 


‘T knew that Peter hadn’t read the book’ 
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What do children do with this relatively complex word order variation? An inves- 
tigation of the three Norwegian children in the Troms@ corpus shows that both 
positions are in place early, illustrated in (34) and (35), with pronouns and full DPs 
appearing with similar proportions and IS patterns as in adult data (Westergaard 
2008b, Anderssen et al. 2010). The children even distinguish between main and 
embedded clauses, producing both DPs and pronouns in the high position in em- 
bedded clauses, while reserving it for pronouns in main clauses, just like in the 
adult data (cf. Westergaard 2011). 


(34) den treng ikke Ann. (Ann.09, age 232.19) 
that need.pres not Ann 
“That one Ann doesnt need: 


(35) gjor den ikke. (Ann.04, age 1;11.0) 
do.PRES it not 
‘(That) it doesn’t do. 


The only non-target-consistent production that is found in the child data is an oc- 
casional delay in the shifting of pronominal subjects to the high position at an early 
stage, illustrated for main clauses in (36) and embedded clauses in (37) (Westergaard 
2011). This is similar to the delay in object movement (OV word order) noted for 
the Russian child data in Section 3.1.1 above. Like in the Russian case, this is a rela- 
tively short-lived phenomenon, the right word order falling into place around age 
2;5-3;0 in main clauses and somewhat later in embedded clauses. According to 
Westergaard (2009a, 2011) this delay is due to economy of movement. 


(36) no kan ikke han sove mer. (Ann. 10, age 233.9) 
now can not he sleep more 
‘Now he can't sleep any more’ 


(37) ..ho si at ikkje det er min kjereste. (Ina.27, age 333.18) 
..she says that not it is my sweetheart 
‘She says that it isn’t my sweetheart. 


3.2.2 Possessive constructions 

Finally, I consider possessive constructions in Norwegian, in which the possessor 
may be pre- or postnominal, as in e.g. min bil ‘my car’ vs. bilen min ‘car.DEF my. 
The choice of the two word orders is dependent on a variety of factors, including 
stylistic level (the prenominal possessor construction being somewhat more for- 
mal). The main distinction between the two is based on IS, in that the prenominal 
possessor construction places contrastive focus on the possessor, while the post- 
nominal possessor construction is more neutral (Ledrup 2011, 2012). This is 
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illustrated in the following authentic examples from the acquisition corpus 
(Anderssen 2006, Anderssen & Westergaard 2010). 


(38) a. ja den derre jabba hennes, den gar i ett sett. 
yes that there mouth her it goes in one set 
‘Yes, that mouth of hers, it moves non-stop: 


b. ez horte hennes stemme over alle de andre nar e kom og 
I heard her voice aboveall the others when I came and 
henta ho. 
picked-up her 
‘I heard HER voice above all the others when I came to pick her up: 


It has been argued that the postnominal possessor construction is more complex 
than the prenominal one, in that it requires definiteness marking on the noun as 
well as syntactic movement of the noun across the possessor (Taraldsen 1990, 
Julien 2005, Anderssen 2006). Lodrup (2012) also argues that the prenominal pos- 
sessor construction represents the unmarked word order, the main reason for this 
being that there are a number of (very different) contexts where it is the only pos- 
sible option. On the other hand, there are virtually no contexts where the opposite 
situation holds, i.e. where only the postnominal possessive is possible. However, in 
informal spontaneous speech the postnominal possessor construction is consider- 
ably more frequent that the prenominal one, accounting for approximately 75% of 
the input. 

Anderssen & Westergaard (2010) investigate monolingual Norwegian chil- 
dren and find that both word orders are in place from early on, as shown in (39) 
and (40). 


(39) han er min mann. (Ole.03, age 110.22) 
he is my man 
“He is my man: 

(40) han verei skufla di. (Ole.05, age 230.10) 
he be inshovel.pEF your 
‘He (should) be in your shovel? 


The only non-target-consistent behavior found in the child data is that there is a 
certain overuse of the prenominal possessor construction at a very early stage, 
despite the fact that it is the least frequent one in the input. Anderssen & 
Westergaard (2012) study the acquisition of this construction in bilingual English- 
Norwegian children and find the same tendency, only that it seems to last slightly 
longer in the bilingual children. This is accounted for by structural complexity and 
economy of movement: The children start out with the least complex construction 
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and perform syntactic movement slightly less that what is required in the target 
language. This delay in the production of the postnominal possessor construction 
is relatively short-lived, and target-consistent use is in place around age 2;4-2;8 in 
the production of the monolingual children. 


4. A model of micro-cues 


4.1 Conservative learning 


After this overview of the acquisition of word order variation, the main question 
is the following: If children easily learn word order variation that is exceptional or 
not based on parameters, why do we need parameters to explain basic facts such 
as head-complement order or V2? These word order rules are very salient in the 
input, and also very frequently attested. If children can learn much more complex 
and much less frequent word order variation from the input, there is no reason 
why VO/OV or V2 should not also be learned from exposure to the primary lin- 
guistic data. In my view, the main question should be WHY children are so good 
at learning fine distinctions from the input. 

The findings reported in this paper support other recent studies both within 
constructivist and generative approaches to language acquisition which have ob- 
served that children are typically what is referred to as ‘conservative learners, 
mainly making errors of omission in their spontaneous production and hardly 
ever errors of commission.” That is, there are very few overgeneralizations attested 
in child language, at least in syntax. Within the constructivist camp, this is argued 
to be due to children’s focus on the input: Language is learned in an item-based 
and usage-based fashion, where children are picking up unanalyzed chunks and 
frames and making generalizations only after a large number of such frames have 
been learned. Both the order of acquisition and many errors that children make 
are related to input frequency, in the sense that the more frequent a construction 
is, the earlier it will be learned and the more likely it is that production will be 
target-consistent, see e.g. Tomasello (2003, 2006), Rowland and Pine (2000, 2003) 
or Ambridge et al. (2006). Within the generative camp, these findings have been 
combined with a parametric account, e.g. in Snyder (2007) or Yang (2002, 2010). 


2. A notable exception to this is the medial-wh phenomenon (e.g. What do you think what 
Cookie Monster eats?), first discussed by Thornton (1990). This has been attested in various ex- 
perimental studies and replicated for other languages than English. However, like Lohndal 
(2010), I do not believe that this phenomenon is due to a parameter (and parameter missetting 
in the child grammar). 
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According to the latter, children are endowed with a set of parameters as well as a 
powerful statistical learning mechanism, allowing them to evaluate evidence for 
and against competing grammars or parameter settings. 

In the next section I sketch a model of micro-cues to account for the variation 
discussed in this paper and children’s virtually error-free acquisition of such com- 
plex patterns. 


4.2 Cue-based acquisition and change 


The term cue is borrowed from David Lightfoot’s work on language acquisition 
and change, e.g. Lightfoot (1999, 2006). According to his theory, children scan the 
primary linguistic data for designated cues provided by UG. The cues are abstract 
pieces of structure in the I-language (internalized language), and they are formu- 
lated in the following way: 


(41) Cue for OV word order: ,)[DP V] 
(42) Cue for V2 word order: cpLXP -V...] 


It is crucial that cues do not correspond to input strings. Rather, the set of sen- 
tences in the child’s primary linguistic data is the triggering experience for cues, 
which are mental representations in the child’s I-language that result from parsing 
these input strings. In the words of Lightfoot (2006: 78), “a sentence EXPRESSES a 
cue if the cue is unambiguously required for the analysis of the sentence.” Thus, 
input plays an important role in this model, but language acquisition is not merely 
an input matching process. 

According to Lightfoot (1999: 93-4), there must also be a UG requirement 
that cues be obligatory. This is due to learnability and the lack of an effect of nega- 
tive evidence in the acquisition process: If cues are acquired only in some contexts, 
then children would produce word order variation (e.g. both VO and OV or V2 
and non-V2), and in cases where this is non-target-consistent (the assumption 
being that this is virtually always the case), children would need negative evidence 
(i.e. correction) in order to reach the target grammar. Since negative evidence is 
typically unavailable to children, they must somehow “know” that cues apply 
across the board. This means that Lightfoot’s cues in some sense correspond to 
major parameters. However, the theory differs from a parameter approach in that 
there is no question about initial settings or resettings, nor does competition 
(between different settings) play a prominent role. Instead, once a child encoun- 
ters evidence for a particular cue in the primary linguistic data, this cue will be 
activated in the child’s I-language. 
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4.3 Micro-cues 


Given recent findings in acquisition data, some of which are reported in this paper, 
I argue that children must be sensitive to much more fine-grained distinctions than 
what both major parameters and Lightfoot’s cues express. That is, children do not 
appear to be learning general rules first, and only later the fine details and excep- 
tions. Instead they seem to focus on the small details first, and then build up their 
grammars incrementally. I have therefore introduced the term micro-cue (see e.g. 
Westergaard 2008a, 2009a, b, c, Lightfoot & Westergaard 2007), in order to account 
for children’s early and target-consistent production of variation. That is, in addi- 
tion to acquiring a particular word order, e.g. V2, children must also learn in what 
linguistic contexts this word order applies. As has been shown in the previous sec- 
tion, children have no problem distinguishing between different clause types, 
different verb types, different initial elements, different information structure pat- 
terns, etc., from early on. In my view, this must therefore be expressed as part of the 
cue. Thus, instead of Lightfoot’s general cue for V2 in (42), there must be many 
smaller cues (micro-cues), identifying the relevant contexts for the verb movement 
rule. For example, there must be different cues for each clause type, as illustrated in 
(43) and (44) for Norwegian declaratives and exclamatives respectively:? 


KP oie Vil 
h 


SU...] 


(43) Micro-cue for V2 in declaratives (Norwegian): nade! 


(44) Micro-cue for non-V2 in exclamatives (Norwegian): 


ExcpLWA pyc 


But the micro-cues must in fact be much more fine-grained than this. In wh-questions, 
for example, they need to distinguish between different verb types in present-day 
English (auxiliaries/be vs. lexical verbs) and subject types (given vs. new, expressed 
here as [+Foc]) and types of initial elements (heads vs. phrases) in Norwegian. Thus, 
micro-cues for English main clause questions and Norwegian questions with short 
wh-elements (heads) should be formulated as in (45) and (46), expressing that verb 
movement only applies to auxiliaries and be in English and to any verb in Norwegian, 
provided that the subject conveys new information ([+FOc]). 


(45) Cue for V2 in wh-questions (English):* aap Wel: 
(46) Cue for V2 in wh-questions with monosyllabic wh-elements (Norwegian): 
intPl nel WA] Tope Topel V- XP 20] J 


3. lam assuming a Split-CP structure (Rizzi 1997), where the topmost projection (ForceP), is 
split into separate projections depending on clause type, ie. Decl(arative)P, Int(errogative)P, 
Excl(amative)P, etc. 


4. In this micro-cue, I is shorthand for auxiliaries and be, indicating that only elements that 
appear in the I position may move to C (i.e. the Int head). The fact that only auxiliaries and be 
may appear in I must be part of a different micro-cue (for V-to-I movement) that does not con- 
cern us here. 
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Micro-cues may similarly be formulated for structures that have not been consid- 
ered to be part of a parameter. Subject Shift may for example be formulated as in 
(47), which would be a piece of structure where an informationally given subject 
(i.e. [-FOc]) occurs in the specifier of the highest projection in the IP domain, the 
In(ner)Top(ic)P, which is above negation and adverbs (see Westergaard 2009a for 
an overview of the syntactic structure assumed for the IP domain). This micro-cue 
would then cause given subjects to move and thus appear in front of negation or 
adverbs, while informationally new or focused subjects would stay in the specifier 
of the TP, cf. examples (32)-(35) above. 


(47) Micro-cue for Subject Shift: [DP 


InTopP [-Foc] J 


A question that now arises is the following: Are micro-cues similar to cues, just 
smaller, i.e. more detailed? Generally, the micro-cues are made up of syntactic 
(sub-)categories and linguistic features. Occasionally, however, they need to make 
reference to particular lexical items, such as the adverb kanskje ‘maybe’ in 
Norwegian, since children immediately distinguish between this adverb and all 
other elements which may appear in initial position in declaratives with respect to 
the V2 rule, cf. examples (15) and (24) above. The extension from cues to micro- 
cues thus involves a crucial difference between the two concepts: Lightfoot 
(1999, 2006) assumes a rich UG, where the cues are provided by the innate lan- 
guage faculty. In the micro-cue approach, on the other hand, cues must be 
language-specific and learned from input. However, the micro-cues are made up 
of syntactic primitives provided by UG. 

This means that the model of micro-cues is a generative approach assuming a 
relatively restricted UG: Generally, the genetic endowment is assumed to consist 
of a pool of possible syntactic primitives (categories, features) and principles of 
structure building (basic syntactic operations, i.e. merge and move) as well as cer- 
tain universal constraints. This genetic endowment is powerful enough to enable 
children to parse the linguistic input that they are exposed to, and formulate the 
micro-cues in their I-language grammars. Thus, the micro-cues do not exist in 
UG, but are part of the grammar of a specific language — that is, they emerge in the 
acquisition process as a result of the interaction between UG and the input. 

In some sense the micro-cue approach is similar to a constructivist approach 
to language acquisition, in that it assumes a learning process starting with small 
constructions and gradually generalizing and building up a grammar. But the 
model of micro-cues is also crucially different from a constructivist approach, in 
that the micro-cues are not unanalyzed chunks or frames, but pieces of abstract 
syntactic structure resulting from parsing the input. 

The acquisition process is also affected by more general cognitive factors, 
e.g. memory limitations or the Concept of Non-shared Knowledge (Schaeffer 2000), 
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mentioned above in connection with the overuse of null subjects or OV word or- 
der in Russian (Gordishevsky & Avrutin 2004, Dyakonova 2004). Furthermore, 
economy is an important concept in the acquisition process: First, as often noted 
in the acquisition literature, omissions are much more common than errors of 
commission.” Second, children only seem to build as much structure as is triggered 
by the primary linguistic data and syntactic movement is avoided, unless there is 
clear evidence for it in the input. Thus, the slight delay that has been attested in the 
Norwegian child data discussed in Section 3 with respect to movement processes, 
ie. V2, Subject Shift or postnominal possessives, may all be accounted for by refer- 
ence to economy. Similarly, economy may account for the findings referred to in 
Section 2 that there is a certain overuse of null subjects at an early stage in various 
child languages, despite the generally target-consistent production corresponding 
to the grammar being acquired (Zdorenko 2005). In my view, this shows that chil- 
dren are sensitive to fine distinctions in syntax and information structure from 
early on, but that a certain effect of economy is present in early child language, 
causing a slight tendency for omissions and lack of syntactic movement. 
Considering the non-target-consistent production in early child language dis- 
cussed in Section 3 in more detail, we may now argue that children typically have 
smaller grammars than adults, i.e. they perform fewer movement operations than 
the target grammar requires. For example, the lack of subject-auxiliary inversion 
in some English-speaking children’s production is the result of the children not 
having acquired all the micro-cues relevant for the target language: That is, having 
learned inversion with be, the children do not automatically generalize to auxilia- 
ries; having learned inversion with the question word what, they do not immedi- 
ately generalize to linguistically distinct why. Similarly, Swedish and Norwegian 
children go through stages where V2 is restricted in declaratives to be, then ex- 
tended to sentences with informationally new or focused subjects, and then finally 
the rule is generalized to all declaratives. This means that the child grammar at a 
certain stage may make even finer distinctions than the target language, e.g. be- 
tween be and auxiliaries (in English child language) and between be and all other 
verbs (in Swedish and Norwegian child language). In most cases, these stages are 
very brief, as children are very sensitive to positive evidence in the input and 
quickly acquire the additional contexts for a movement operation. But the main 
point is that they are conservative learners: Instead of immediately generalizing to 
all contexts (e.g. all clause types or all verbs), they wait until they have positive 
evidence in the input. In this way, they avoid having to ‘unlearn’ a rule (for which 
they would have to have negative evidence in the form of correction), but they 


5. Children’s frequent omissions are also used by Faarlund (2008) to explain the diachronic 
process referred to as grammaticalization. 
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therefore also occasionally ‘undergenerate; i.e. their grammars are delayed in con- 
texts where generalization would in fact lead to a target-consistent result. 

All theories of language acquisition must have an account of how children gen- 
eralize from individual occurrences to more general rules. Within a parameter ap- 
proach to acquisition, the generalization is supposedly immediate and takes place 
across any instance of a category. In my view, this should lead to massive overgen- 
eralization in child data, which I have argued in this paper is not attested with re- 
spect to word order. Within a constructivist approach, generalizations should be 
late and based on a large number of examples in the input. And within the micro- 
cue approach, I claim that it is something in between: Children generally depend 
on the input, but they also make relatively rapid generalizations. The crucial point 
is that they only generalize within a small subcategory, not to all members of a 
major category (e.g. all verbs or all nouns); that is, they extend their grammars in 
small steps, adding new micro-cues. We should thus be able to find children mak- 
ing minor overgeneralizations in the acquisition process. As already stated, there 
are hardly any examples of syntactic overgeneralizations in acquisition data, but 
the ones that do exist suggest that this is a short-lived phenomenon and only hap- 
pens within a minor class or subcategory. In examples (48a, b), the English-speaking 
child (who has learned inversion with be) has extended the inversion rule to two 
other verbs, call and mean, which are syntactically and semantically similar to be 
(according to Roeper 2007, this pattern lasted for about a week). Extensions of the 
inversion rule to other verbs are virtually non-existent in acquisition studies. 


(48) a. what calls that? 
b. what means that? (from Roeper 2007) 


Summarizing, this means that the model of micro-cues is a generative approach to 
language acquisition that at the same time tries to make sense of some of the find- 
ings in the constructivist literature, cf. Section 2.2 above. That is, children’s conser- 
vative approach to language acquisition does not mean that learning is item-based 
(as argued in the constructivist camp), but instead ‘class-based’ or ‘feature-based’ 
The main question within the micro-cue approach to language acquisition is thus 
how far a child will generalize, ie. what constitutes a relevant class or subcategory 
(a next step) in the learning process. In my view, this is an empirical question that 
should be the focus of much further research. 


5. Summary/conclusion 


In this paper, I have considered various types of linguistic variation and shown 
that children are sensitive to fine distinctions in syntax and information structure 
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from early on. Thus, I have argued that there is no evidence in the investigated 
child data that the children are setting word order parameters, as there are hardly 
any overgeneralizations attested in the relevant contexts. Instead I have presented 
a model of micro-cues, which is related to the common observation that children 
are conservative learners, hardly ever making errors of commission. Micro-cues 
are small pieces of syntactic structure which emerge in children’s I-language gram- 
mar on exposure to the relevant input. This means that the micro-cues are 
language-specific, but they are made up of categories and features provided by UG 
and constrained by universal principles. According to the micro-cue model, chil- 
dren do not learn the major generalizations for their language first, but start out 
with small pieces of structure and build up their grammars incrementally. When 
they make syntactic generalizations, they do not immediately extend to a major 
category, but only within a class or subcategory, i.e. one micro-cue at a time. This 
means that children typically have smaller grammars than adults, and their non- 
target-consistent production (omissions, lack of movement) may typically be ac- 
counted for by principles of economy. 
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The evolution of language 
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University of Oslo 


How language evolved is one of the most debated subjects in science. Starting 
with Darwinian ideas of natural selection, scholars interested in the evolution 
of language have been polarized into two main camps, those who propose 

the faculty of language is a uniquely human trait, and those who claim that 
language arose from pre-existing systems of animal communication. This 
chapter discusses some important contributions to the question of language 
evolution, including studies of the physical and cognitive evolution of modern 
humans, the relationship between social complexity and brain size, the gestural 
theory of language, theory of mind, and the use of molecular genetics to identify 
putative language genes, concluding with the analysis of the FOXP2 gene in the 
Neanderthals. 


What is it that man can do, and of which we find no signs, no rudiments, in the whole 
brute world? I answer without hesitation: the one great barrier between the brute 
and man is Language. Man speaks, and no brute has ever uttered a word. 

Max Miiller, Lectures on the Science of Language (1861: 340). 


Introduction 


Language is widely recognized as the single, most important feature that distin- 
guishes humans from animals, and permits us to think and express thoughts. The 
origin of language has been called “the hardest problem in science” (Christiansen 
& Kirby 2003: 1), and speculations on the subject are tied to those on the origin of 
the human mind itself. Scholars disagree on the precise nature of language (and by 
extension, cognition), and whether it emerged suddenly in human evolution, or 
gradually from pre-existing systems of animal communication (Ulbaek 1998). 
Arguments on the uniqueness of humans reached fever pitch in the nineteenth 
century, particularly after the publication of the Origin of Species by Charles Darwin 


302 Erika Hagelberg 


(1859). The eminent Victorian naturalists Richard Owens and Thomas Henry 
Huxley almost came to fisticuffs over a small seahorse-shaped structure in the hu- 
man brain known as the hippocampus minor. Owen believed it was found in Homo 
sapiens alone, while Huxley demonstrated in painstaking anatomical investigations 
that humans and apes shared the trait, striking a blow against those who argued for 
a strict discontinuity between humans and other animals. In his essay A Matter of 
Degree, the evolutionary biologist Stephen J. Gould tells the story of how 


“for a month, in April, 1861, all England watched as her two greatest anatomists 
waged war over a little bump on the brain. Punch laughed and versified; and 
Charles Kingsley wrote at length of the “hippopotamus major” in his children’s 
classic of 1863, The Water Babies.’ (Gould 1978: 50). 


However humorous the spectacle of the squabbling great men, the debate, consid- 
ered crucial to the question of the uniqueness of humans, was followed with eager- 
ness by the scientific establishment and the public. But even Huxley, known as 
“Darwin's Bulldog” for his passionate support of Darwin's evolutionary theories, 
and indefatigable in his search for biological continuity between animals and hu- 
mans, insisted on the uniqueness of one human faculty, language. In the book 
Evidence as to Mans Place in Nature, he asserted “that the possession of articulate 
speech is the grand distinctive character of man” (Huxley 1863: 122). 

Darwin himself, in his Origin of Species, had proposed that all the characteris- 
tics of living organisms had developed from earlier, pre-existing forms, but ini- 
tially did not dare enter into a discussion of human origins or the evolution of 
human traits. More than a decade later, in The Descent of Man and Selection in 
Relation to Sex (1871), he suggested that speech had derived from existing systems 
of animal communication, through an intermediate stage of music. A musical 
proto-language, he wrote, was probably used in courtship and to mark territory, 
and full language emerged eventually when our ancestors attached meanings to 
certain vocalizations, perhaps through onomatopoeia. Darwin remarked that peo- 
ple had an innate capacity to speak, whereas individual languages themselves had 
to be taught, as was the case with the complex songs of some birds (Fitch 2009). 
Darwin recognized that humans and animals shared not only such vocal learning, 
but also traits such as memory, emotions and attention. In The Expression of the 
Emotions in Man and Animals, he wrote that “the earliest progenitors of man” had 
gradually learned to utter sounds to express strong emotions, similar to the ex- 
pressions made by living apes (Darwin 1872/2009: 92). 

While Darwin consciously avoided references to human evolution in Origin, 
the implication that language may have derived from animal communication was 
obvious and controversial. The assumption was attacked by the German philolo- 
gist and Orientalist Friedrich Max Miller, who held that language was the princi- 
pal trait that separated humans from animals. Miiller wrote 
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“Language is our Rubicon, and no brute will dare to cross it. This is our...answer to 
those who speak of development, who think they discover the rudiments at least 
of all human faculties in apes, and who would fain keep open the possibility that 
man is only a more favoured beast, the...conqueror in the primeval struggle for 
life. Language is something more palpable than a fold in the brain or an angle of 
the skull...no process of natural selection will ever distil significant words out of 
the notes of birds or the cries of beasts.” (Miiller 1861: 340). 


Despite his strong support for the uniqueness of language, Miiller recognized that 
many animals, including gorillas, whales and dogs, had efficient communication 
systems, and also the capacity of articulation, as in parrots, but emphasized that 
language was a mental faculty specific to human beings. 

In An Essay on the Origin of Language, Based on Modern Researches, published 
shortly after Darwin's Origin, the Oxford scholar Frederic Farrar discussed not 
only the writings of Miiller, but those of many other scholars with an active interest 
in the subject, including Jacob Grimm (Ueber den Ursprung der Sprache) and Ernest 
Renan (De l’Origine du Langage). Farrar was a bullish supporter of the uniqueness 
of human language, and asserted “No study is more rich in grand results than the 
study of language, and to no study can we look with greater certainty to elucidate 
the earliest history of mankind. (Farrar 1860: 2). Farrar doubted whether the ques- 
tion of the origin of language would ever be answered, as the earliest available 
evidence of all known ancient languages, including Sanskrit and Hebrew, was sep- 
arated by thousands of years from the first languages and the beginnings of hu- 
manity itself. However ephemeral language may appear, he believed spoken and 
written languages were governed by rigorous laws, whose careful study could help 
understand the whole of human history, in the same way as geologists could reveal 
the history of the earth from the physical evidence in rock formations. 

The seemingly endless speculations, combined with the dearth of empirical 
evidence and testable hypotheses, led in 1866 to the famed ban on discussions on 
the origin of language by the Linguistic Society of Paris. Despite continued inter- 
est, however, there was little substantial advance until the publication of Noam 
Chomsky’s Syntactic Structures in 1957, and ideas elaborated in many subsequent 
works (Chomsky 1957, 1975, 1988). Chomsky proposed that the faculty of lan- 
guage was innate, and governed by a “universal grammar” that transcends the 
grammars of individual languages. His ideas stimulated new debates, provoking 
an explosion in the number of articles on the origin of language, in subjects rang- 
ing from linguistics, psychology and computer science to anthropology, neurosci- 
ences and molecular genetics. The aim of this chapter is to provide a whistle-stop 
tour through some of the contributions to the puzzle of language origins, contri- 
butions which, however diverse, often return to the basic question of the unique 
nature of human language. 
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The role of natural selection 


Noam Chomsky doubted a Darwinian explanation of language, favouring instead 
discontinuity with animal communication. He proposed that language was a hu- 
man-specific system of representation, created by a sudden accident or mutation. 
Chomsky’s concept of universal grammar, the ability of every child to learn its na- 
tive language even before it can master seemingly easier tasks, applies to every 
language and feature of language, including speech, writing, and sign language. 
Chomsky argued that the ability of a child to acquire language is hard-wired in the 
human brain, in an innate language acquisition device unparalleled in the animal 
world, a human-specific “language organ”. Chomsky did not seek a mechanism for 
the origin of language, and is reported to have said that speculations on the origins 
of the language organ were as futile as speculations on the origin of the heart or 
any other organ of the body. His Cartesian definition of language as a specific 
property of the mind gave him freedom to develop his ideas on universal gram- 
mar, while leaving others to expound on the biological implications of his views. 

To an evolutionary biologist, Chomsky’s attitude was baffling, because the ori- 
gin of any organ, including a language organ, requires an explanation, and for 
most biologists the only logical explanation would be Darwinian natural selection 
(Maynard-Smith & Szathmary 1995: 290). An alternative, first proposed by the 
evolutionists Stephen J. Gould and Richard Lewontin, was that language was not 
the result of natural selection but a byproduct of other evolutionary forces, for 
example those resulting in greater brain size and complexity. In other words, lan- 
guage was an accidental feature such as the spandrels in Renaissance architecture 
(Gould & Lewontin 1979). Spandrels are the filled spaces at the intersection of 
arches and pillars, or between a dome and its supporting pillars (in this specific 
case they are called pendentives), as those of the San Marco basilica in Venice. 
These three-dimensional structures are complex and often beautifully ornament- 
ed, but not thought to be an intentional result of the design plan, but rather geo- 
metrical “leftovers”. Biologists call such a feature an exaptation, or a characteristic 
that may increase fitness but is not the direct result of natural selection. 

Other opponents of the role of natural selection in the evolution of language 
were Massimo Piattelli-Palmarini (1989) and Philip Lieberman (1989), but their 
views were challenged. Piattelli-Palmarini believed that grammar could not have 
resulted from selection because it has arbitrary elements and is not a predictable 
adaptation to communication, but he is held to misunderstand evolution by natu- 
ral selection. Lieberman argued that grammar does not exhibit the genetic vari- 
ance necessary for selection, but this is not necessarily true. People do exhibit a 
great range of linguistic skills, and just because we lack evidence of the heritability 
of grammatical competence it does not follow that variation does not exist (May- 
nard-Smith & Szathmary 1995: 291). 
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Figure 1. Spandrels are architectural byproducts formed at the intersection of a domed 
vault and the supporting pillar. Here, a spandrel of the St. Mark basilica in Venice is beau- 
tifully decorated with a figure of an evangelist and the personification of one of the bibli- 
cal rivers. Gould & Lewontin (1979) suggested language was a spandrel, a complex, but 
accidental, feature of evolution (reproduced from Gould & Lewontin 1979: 582, by per- 
mission of The Royal Society) 


In a landmark article, Pinker & Bloom (1990) attempted to reinstate Darwinian 
natural selection at the centre of the debate on the origin of language. 


“The only successful account of the origin of complex biological structure is the 
theory of natural selection, the view that the differential reproductive success as- 
sociated with heritable variation is the primary organizing force in the evolution 
of organisms.” (p. 708). 


They challenged Chomsky, as well as Gould & Lewontin, and rejected the idea that 
language was a spandrel, asserting instead that it was entirely explicable by Dar- 
winian natural selection, as it met two conditions: it conferred reproductive ad- 
vantage, and there had been sufficient time in the course of human evolution for 
language to emerge gradually. 

According to Pinker & Bloom, the evolution of grammatical competence 
would have conferred certain advantages to our ancestors, including the ability to 
plan and share knowledge to hunt and avoid predators more effectively. To allow 
ample time for natural selection to act, Pinker & Bloom dated the beginnings of 
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language to the time of the australopithecines, about two to four million years ago. 
Such a long evolutionary time-scale would have allowed “a series of steps leading 
from no language at all to language as we now find it, each step small enough to 
have been produced by a random mutation... and each intermediate grammar use- 
ful to its possessor.” Unfortunately, conceded the authors, “There are no conclusive 
data on any of these issues” (Pinker & Bloom 1990: 721), and it is true that, not- 
withstanding their slightly larger brains, there is no archaeological evidence of 
more complex social behaviour in australopithecines than in non-human apes 
alive today. Despite their strong arguments in favour of natural selection, Pinker & 
Bloom not only lacked data to support their hypothesis, but also failed to propose 
a mechanism for the origins of the “very first forms of language and the specific 
sequence leading to its current form” (p. 765). 


The paradox of continuity 


Another proponent of natural selection, the linguist Derek Bickerton, agreed that 
language must have evolved from some pre-existing system, while recognizing the 
huge gap between human language and other systems of animal communication. 
The problem, according to Bickerton (1990) was the absence of an obvious inter- 
mediate stage between even the most complex forms of non-human communica- 
tions and language itself, the so-called “Paradox of Continuity”. Like Chomsky, 
Bickerton regarded language as a system of mental representation rather than 
solely a means of communication. According to Bickerton, every animal, even an 
earthworm, had a sensory system to process information about its environment 
sufficient for survival, and a motor system to respond to that information, as well 
as a representational system to link the two. The representational system provided 
the raw material for natural selection, as animals with more complex systems were 
better equipped to respond to their environment and gained a reproductive ad- 
vantage. In an attempt to resolve the Paradox of Continuity, Bickerton proposed 
that language had evolved from systems of animal representation in two steps. 
Firstly, sometime between 1.5 million and 500,000 years ago, in Homo erectus 
(an archaic, large-brained human ancestor), a proto-language or precursor of true 
language evolved from an advanced system of representation like that found in 
primates. For Bickerton, a specialist in pidgins and the language of young chil- 
dren, human proto-language consisted of series of words without grammar, as 
spoken by infants or by adults speaking haltingly in a foreign language. Bickerton’s 
second step involved the sudden emergence of fully-fledged syntax in anatomi- 
cally modern humans, sometime in the last 200,000 years. 
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While seemingly espousing an evolutionary approach to language evolution, 
Bickerton’s second step, involving the sudden acquisition of grammar, appeared to 
contradict his continuity approach in favour a saltationist account, more in keep- 
ing with Chomsky’s programme, and unable to resolve the continuity paradox. In 
a later work, Bickerton went as far as to suggest that the emergence of syntax was 
a “catastrophic event” (Bickerton 1995: 69). Fortunately for Bickerton, in 1987 
“Chance lowered onto the stage the deus ex machina of the African Eve hypothesis, 
granting...a new license to saltationism.” (Studdert-Kennedy 1991: 260). This re- 
mark refers to the analysis of a type of DNA, mitochondrial DNA (mtDNA), that 
is inherited solely through the maternal line, and is still used widely as a research 
tool in population genetics. Evolutionary biologist Allan Wilson and colleagues 
Mark Stoneking and Rebecca Cann constructed a worldwide phylogenetic tree of 
mtDNA molecules of living women, and concluded that people of all continents 
descended from a common female ancestor who lived in Africa 140,000-280,000 
years ago. The mtDNA data appeared to support the view of a recent African ori- 
gin of a distinct species, anatomical modern humans, who eventually radiated out 
of Africa and replaced all archaic humans throughout the world, including 
Neanderthals in Europe (Cann et al. 1987). The “Out of Africa” or total replace- 
ment hypothesis contrasted with a competing model, the Multiregional hypothe- 
sis, that argued for genetic continuity between archaic humans and modern Homo 
sapiens throughout the world. 

This novel type of DNA analysis appeared to argue incontrovertibly for the 
recent emergence of a distinct species of clever, talking humans. Such a conclusion 
agreed with evidence from archaeology and anthropology for a sudden “human 
revolution” in biology and behaviour (Mellars & Stringer 1989). Anatomically 
modern humans, it was suggested, not only had different DNA, and differed phys- 
ically from other hominins (the term used to denote humans and their ancestors) 
in the shape of their skull and vocal tract, but were the first to make elaborate stone 
tools, cave art and symbolic objects in what became known as the Upper Palaeoli- 
thic Revolution (Mellars 1989). 


“By 40 000 year ago, populations in Europe and elsewhere were producing a wide 
range of novel artefacts, were burying their dead, painting on the walls of caves, 
and engaging in trade. It seems that something happened to make possible both 
the geographic spread of H. sapiens, and the burst of new technical and cultural 
practices. The obvious candidate is the origin of human language.” (Maynard- 
Smith & Szathmary 1995: 278). 


The Out of Africa hypothesis, and the idea of a cognitive revolution associated with 
anatomically modern humans, relegated the Neanderthals to the periphery of 
human evolution, a human species devoid of language and sophistication. The 
Neanderthals, it was argued, despite their large brains, “appear to be deficient with 
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Figure 2. The human fossil record, showing four possible groupings of human ancestors. 
The australopithecines were the earliest bipedal hominins, with brains not much larger 
than living primates. They were followed by two lineages: the robust, small-brained 
paranthropines, and the larger-brained Homo habilis, leading to Homo erectus, the 
Neanderthals and modern humans. The details of this scheme are still debated, as is the 
precise evolutionary relationship of Neanderthals to modern humans (reproduced from 
Maynard Smith & Szathmary 1995: 277, by permission of Oxford University Press) 


respect to their linguistic and cognitive ability. ... they probably communicated 
vocally at extremely slow rates and were unable to comprehend complex sentences.” 
(Lieberman 1989: 391). 

Ironically, by assuming a sudden cognitive leap in human evolution, some 
scholars were reinforcing the unique status of modern humans, including in the 
use of language, something more akin to Cartesianism than Darwinism. Such an 
attitude prevailed in many studies of human evolution until recently, when techni- 
cal advances in DNA sequencing have permitted the retrieval of considerable 


The evolution of language 309 


genetic information from the skeletal remains of the Neanderthals and a reap- 
praisal of the hypotheses of recent human evolution. 


Language and brain size in humans 


However heated the arguments on the role of natural selection in the origin of 
language, the fact remains that humans, in common with other primates, have 
large brains (specifically, large neocortices), and complex social interactions. The 
brains of our human ancestors had almost tripled in size well before the presumed 
origins of language, and there is no doubt that understanding the processes that 
drove human evolution and this increase in brain size is crucial for understanding 
the origins of language. 

Studies based on DNA data indicate that the lineages leading to humans and 
chimpanzees, our closest living relatives, split sometime over 4 million years ago 
(Hobolth et al. 2007). The first upright creatures arguably regarded as human 
ancestors were the australopithecines, small, gracile individuals like the famous 
Lucy, who was bipedal but had the brain capacity of an ape, indicating that an 
upright stance predated the enlargement of the brain (Johanson & Maitland 
1981). Evidence of the antiquity of bipedalism comes from the discovery by 
Mary Leakey of the Laetoli footprints preserved in hardened ash, dated to about 
3.5 million years ago. After this, there are two periods in human evolution 
marked by significant increases in brain size. The first was roughly 1.7 to 1.5 
million years ago, during the transition between australopithecines and the ear- 
ly forms of the genus Homo, and was probably driven by the shift from an 
arboreal lifestyle to subsistence in a more open, arid and hot environment, and 
an increase in the proportion of protein in the diet. Brain volume increased from 
400-700 cm? in australopithecines to about 1000-1200 cm? in Homo erectus, 
probably as a result of the behavioural demands of the open habitat, but this was 
not associated with specific developments in symbolic or cognitive ability, or 
language acquisition (Aiello 1996). The second increase in brain size happened 
between 800,000 and 200,000 years ago, and corresponds to the emergence of 
Homo sapiens, including the Neanderthals, forms of sapiens that are considered 
archaic by some palaeoanthropologists. The size of the modern human brain is 
between 1200 and 1500 cm%. Interestingly, Neanderthal brains are slightly larger 
than those of modern humans, ranging from about 1300 cm? in females to 1600 
cm? in males, with one male specimen known to have had a massive 1740 cm? 
brain (Stringer & Gamble 1993). 
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Figure 3. The relationship between cranial capacity (cubic centimetres) and age (millions 
of years) for the hominins. There were two periods of rapid increase in cranial capacity, 
and the latter remained comparatively constant throughout the Homo erectus period. 
Symbols as follows: plus sign = australopithecines; open box = early Homo, triangle = 
Homo erectus; filled box = Homo sapiens including Neanderthals (reproduced from Aiello 
1996: 90, by permission of The McDonald Institute for Archaeological Research) 


Homo erectus was a successful hominin, well adapted to his habitat, large-brained, 
omnivorous, and bipedal. The H. erectus phase of evolution extended from 
1.7 million to about 250,000 years ago, with little increase in brain size, probably 
no speech, and relatively little change in the stone tool culture, known as Acheulean 
(Gowlett 1992). But even if H. erectus lacked many of the anatomical features 
known to be associated with speech in modern humans, it must have developed 
one preadaptation to speech by virtue of its upright position, a valvular larynx. All 
mammals with prehensile forelimbs have a valvular larynx which, when closed, 
creates air pressure that stabilizes the chest to anchor the arm muscles in locomo- 
tion. The anthropologist Leslie Aiello considers the valvular larynx, and bipedal- 
ism itself, as preadaptations for speech. A closed larynx allows the airways to trap 
air and release it during phonation (Whitcombe 1996). In a bipedal animal, argues 
Aiello, “the selective pressure on the locomotor function of the valvular larynx 
would have been relaxed” and this “would have permitted the development of... 
more membranous vocal folds (cords) which would be associated with less harsh, 
more melodious sound” H. erectus also developed a shorter and wider mandible 
as a result of diet changes, which created a better shape for the movements of the 
tongue in speech (Aiello 1996: 94). 

Despite the relatively constant brain size and material culture of H. erectus for 
over one million years, the tools made by this hominin were comparatively 
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sophisticated, and this genus was able to expand out of Africa and colonize suc- 
cessfully many parts of Europe and Asia. The Acheulean stone tool culture did not 
disappear completely until about 150,000 years ago but, before that, the various 
tool manufacturing techniques and the large brain size associated with H. sapiens 
emerged. Tool styles remained relatively conservative well after the appearance of 
anatomically modern humans, and the burst of technological innovation associ- 
ated with more varied and sophisticated tools, and symbolic representation, did 
not happen until the Upper Palaeolithic, about 40,000 years ago. Language as we 
know it appeared sometime during this wide interval, but when and how is still 
open to speculation. 


Language as verbal grooming 


One explanation for the emergence of spoken language in Homo sapiens concerns 
its role in the social organization of large groups of individuals, the social brain 
hypothesis. The best predictor of brain development appears to be the size of the 
social group of an individual. Living primates maintain social cohesion by engag- 
ing actively in manual grooming, something that requires time and energy, and 
occupies forelimbs that could be used otherwise for gathering food. The evolu- 
tionary psychologist Robin Dunbar proposed that vocal grooming, or gossip, was 
a solution to free the hands and permit interactions between multiple partners in 
large social groups. To test this hypothesis, he compared the mean size of social 
groups to the neocortex volume of several taxa of living anthropoid primates, and 
revealed a positive correlation between the two, meaning that the larger the brains 
of the primates, the larger their social groups. Extrapolating this analysis to mod- 
ern humans yielded a group size of 150 individuals, arguably typical of one level of 
hierarchical organization in many societies, and similar to the number of people 
with whom one has a personal relationship (Dunbar 1993). 

Together with Leslie Aiello, Dunbar investigated this relationship in fossil 
hominids by estimating the neocortex volumes from the cranial volume of fossil 
skulls. They concluded that australopithecines and early Homo probably lived in 
smallish groups, of a size within the range of the living great apes, who maintain 
social cohesion by manual grooming. However, sometime after one million years 
ago, in the H. erectus lineage, group size started to grow exponentially (Aiello & 
Dunbar 1993). Dunbar estimated that a primate group would not be able to sus- 
tain more than 20% of its time in manual grooming, a figure based on observa- 
tions of living primates, and therefore suggested “a rubicon [sic] at around 30% of 
grooming time requirement beyond which group size could not increase unless 
language came into play.’ Such an estimate yielded a date of approximately 
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0.5 million years ago for the emergence of verbal grooming, with Homo erectus 
striding the Rubicon but Homo sapiens exceeding it, suggesting that the former 
almost certainly lacked the capacity of language (Dunbar 2003: 174). 

Dunbar invokes two independent pieces of anatomical evidence in favour of 
the emergence of language in the H. sapiens lineage. The first is the size of the hy- 
poglossal canal at the base of the skull containing the main nerve that controls the 
movements of the tongue, which is much larger in humans than other primates 
(Kay et al. 1998). The second is the particular shape and size of the thoracic region 
of the vertebral canal in humans, probably connected to the fine control of breath- 
ing needed for speech (MacLarnon & Hewitt 1999). Both of these traits, according 
to Dunbar, are like those of modern humans in archaic H. sapiens, but not in ear- 
lier fossil specimens, including H. erectus, supporting the view that speech devel- 
oped in the H. sapiens lineage (Dunbar 2003). 

Aiello and Dunbar (1993) proposed four stages in the evolution of language 
(reviewed in Dunbar 2003): 


i. manual primate grooming in small groups of australopithecines, 

ii a type of general vocal chorusing or chattering for social bonding in larger 
groups (as in some living primates), in Homo erectus, 

iii. directed, meaningful communication to support even larger groups, in archa- 
ic Homo sapiens, and 

iv. full symbolic language, with use of metaphor, probably associated with the 
Upper Palaeolithic Revolution. 


In Dunbar’s social brain hypothesis, music may have played a role in the evolution 
of language, something that would have pleased Darwin. Darwin's ideas on music 
as a stage in the evolution of language have been elaborated by Fitch (2009) in an 
essay to commemorate Darwin's bicentenary, entitled Musical Protolanguage: 
Darwin's Theory of Language Evolution Revisited, and also by Steven Mithen, for 
example in The Singing Neanderthals (Mithen 2005, 2009). These views give pri- 
macy to the role of speech in language evolution by suggesting that language 
emerged from communal, wordless chattering or singing, with certain sounds 
eventually developing meaning, and contrasts with the hypothesis that language 
evolved from gestures, described below. 


Words from gestures 
Rather than focus on speech, as in the social brain hypothesis, some scholars sug- 


gest that language, whether spoken or signed, evolved from pre-existing gestural 
systems in primates (Rizzolatti & Arbib 1998). The role of gesture in communication 


The evolution of language 313 


was recognized long ago, for example by early travellers who attempted to com- 
municate with local peoples by means of hand-signals. Our closest non-human 
relatives, the chimpanzees and bonobos (pygmy chimpanzees), have an effective 
system of communication using face and hand gestures, but have little control of 
vocalization, and attempts to teach them to speak have met with little success. On 
the other hand, they can be taught a relatively large number of specific signs 
(Gardner & Gardner 1969). The primatologist Savage-Rumbaugh and colleagues 
had remarkable results training a bonobo to communicate using a specially made 
keyboard with 256 symbols standing for words (1998). However, even a large ges- 
tural repertoire does not constitute true language, in the sense of a system with 
grammar and semantic content, something that appears beyond the capability of 
non-human primates. Conversely, human sign language contains all the gram- 
matical sophistication of spoken language, and children learn sign language as 
easily as spoken language if exposed to it during infancy. If true language is pos- 
sible with gestures alone, as in sign language, then “why the switch” to spoken 
language? asks Corballis (2009a: 28). 

One of the reasons is that speech is less energetically demanding than gestur- 
ing, as well as more efficient for communication and maintaining social cohesion 
in large groups, as discussed by Dunbar (2003). Corballis suggests that speech 
freed the hands for tool use, art, teaching and carrying, arguably precipitating the 
revolution in human behaviour described by archaeologists and palaeontologists 
(Mellars & Stringer 1989). It is worth remembering, however, that this “revolu- 
tion” happened late in human evolution. The use of the H. erectus type of Achulean 
hand tools persisted for several tens of thousands of years after the appearance of 
anatomically modern humans. Tool use began much earlier than language, and 
small-brained human precursors had already started to use manufactured stone 
tools, as opposed to just conveniently shaped sticks and stones, as long ago as 2.5 
million years, as revealed by archaeological finds in Ethiopia (Semaw et al. 1997). 

In his 1876 essay The Part Played by Labour in the Transition from Ape to Man, 
Friedrich Engels discussed the importance of tool use in human evolution. Engels 
recognized that the three most important features of humans were bipedalism, a 
large brain, and language, in that order. He held that the adoption of an upright 
position by our primate ancestors had freed the forelimbs for other activities, most 
importantly the use of tools. Engels suggested that tool use happened first, fol- 
lowed by increased intelligence and language, and in this respect he appears 
remarkably prescient. Manual praxis and language are now known to be intercon- 
nected in the brain, but the hands of our human ancestors were already occupied 
with tool making well before the supposed origin of language. Rather than freeing 
the hands for work, it is more likely that manual praxis drove brain size and ulti- 
mately language (Engels 1895/1934; Gould 1978). 
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Studies on patients with aphasia (language disorders) originally helped to local- 
ize the faculty of language in two discrete areas of the brain, Broca’s area and 
Wernicke’s area, named after the nineteenth century neurophysiologists who first 
identified them. It is now known that areas of the brain cortex implicated in language 
extend much further, to neural networks associated with both language and manual 
tasks. Different functions of language, including comprehension, grammar and pho- 
nation, are distributed across wide areas of the brain also involved in manipulation 
of objects such as pencils, scissors, and chopsticks, as well as with sequence predic- 
tion, visual perception, numbers, and music (Stout & Chaminade 2012). 

Engels, a keen student of natural science, would have been delighted to learn 
that, almost a century and a half after his essay was written, developments in 
neurophysiology and functional brain imaging would permit scientists to study 
directly the areas of the brain involved in learning, tool use, memory and lan- 
guage. Even more significant for his theory is the gradual realization that many 
aspects of grammar and spoken language are linked to the motor system. Re- 
search in this field is growing at a vertiginous rate, as illustrated by the recent 
publication of an issue of Philosophical Transactions of the Royal Society B wholly 
dedicated to the subject, entitled From Action to Language: Comparative Perspec- 
tives on Primate Tool Use, Gesture and the Evolution of Human Language (Steele 
et al. 2012). 

Many language functions mapped in the human brain correspond to the so- 
called mirror system in nonhuman primates. Mirror neurons are a class of visuo- 
motor neurons that are activated when a monkey observes another monkey or a 
human performing an action, like reaching for an object. Some mirror neurons 
are associated with hand actions and some with mouth actions (such as grasping 
food or lip smacking) and it is increasingly evident that they had an important 
role in the evolution of language. Mirror neurons are thought to be involved in 
action understanding, and possibly also imitation. Mirror neurons, and their 
equivalents in humans, provide the mechanism for a link between the sender of a 
message and its receiver, essential for communication (Rizzolatti & Craighero 
2004). 

Brain imaging studies in monkeys indicate that they already possess “the ca- 
pacity to mentally represent the action goal” (Rizzolatti & Craighero 2004: 183). 
The mirror neurons fire even when an object to be grasped by the hand is hidden, 
as long as the monkey is aware of the purpose of the action. Corballis (2010: 27) 
proposed that, sometime in hominin evolution, “the mirror system has assumed 
an additional role in language” In monkeys, the semantic content of the mirror 
neuron system is linked intrinsically to the gestures, whereas in speech the articu- 
latory actions and the meanings have split, marking the transition from gestural 
meaning to abstract sound meaning in human evolution. Brain imaging and 
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behavioural studies have revealed a clear link between the neural networks in- 
volved in gestures and those involved in sounds. Nevertheless, the evidence is 
complex, and the gestural theory of language still remains speculative and has 
been questioned by some authors, for example Burling, who points to the “nearly 
fatal flaw” of the theory, the lack of a satisfactory explanation for the switch from 
a visual language to an audible language (2005: 123). 

To explain how vocal communication may have become dominant over visible 
signs, Burling suggests a prolonged period of premusical and prelinguistic vocal- 
ization in human evolution, giving enough time for the vocal tract to come under 
good voluntary control before the emergence of anything resembling spoken lan- 
guage. Burling argues that a successful switch from gestures to vocalization would 
require significant cognitive development. 


“The understanding of both visible and audible signs should have benefited from 
better joint attention, better imitation, and a more skilful ability with motivated 
signs. As cognitive abilities improved, our ancestors should have become better at 
understanding both vocalizations and visible gestures.” (Burling 2005: 124). 


Theory of mind, recursion, and the faculty of language 


Language does not just have a role in communication, but also in thought and 
memory, as well as in the storage and transmission of knowledge in oral and writ- 
ten form (Corballis 2009b; Suddendorf et al. 2009). As seen previously, many of 
the debates on the uniqueness of language centre on the question whether animals 
can think. Many of the attributes of human culture are defined by “Theory of 
Mind’, meaning the ability to attribute mental states (including intentions, desires, 
beliefs and fears) to our fellow beings, as well as by levels of intentionality beyond 
those thought to exist in non-human animals (Briine et al. 2003). An animal such 
as a dog has at least one level of intentionality, in so far it possesses goals or desires, 
and is thought to have a mind, but not a theory of mind. Humans have a theory of 
mind, and can imagine what others think, defined as second order intentionality, 
and have also third and fourth order intentionality (Dennett 1983). Some human 
activities, including religion, require four levels of intentionality (shown in italics 
in the following example): “I have to believe that you suppose that there are super- 
natural beings who can be made to understand that you and I desire that things 
should happen in a particular way.’ (Dunbar 2003: 177). 

Simon Baron-Cohen (1995) proposed the existence of several steps or compo- 
nents in the ability to represent another person's beliefs. In the first level there is 
sensory input, which is shared by humans and other animals. In the second level 
there is a shared attention mechanism, or awareness of what another being can see. 
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This mechanism is thought to be present in chimpanzees, and is arguably recursive 
(he sees that I see that he sees, etc.) (Fitch 2010). The third mechanism involves 
theory of mind itself, believed by Baron-Cohen to exist in humans but not other 
animals. 

Chomsky (1988) suggested that abstract thought and language depended on 
the evolution of a computational capacity that was uniquely human. However, 
studies on animal cognition have demonstrated that non-human animals also 
have the ability to construct abstract representations. As described by Gallistel 
(2011: 253), “the symbol processing capacity underlying the thoughts expressed in 
human language appears to be evolutionarily ancient.” The ability to represent the 
outside world in a structured symbolic system is found throughout the animal 
kingdom, and what seems unique to humans is the specific mechanism for map- 
ping the private representations into a communicable system, language. 

In the social brain hypothesis, Dunbar (1992) proposed that social animals 
have complicated social interactions demanding complex competition and coop- 
eration strategies, even the capacity of mental attribution, meaning the ability to 
be aware of the point of view of an “other”. More than a decade earlier, the 
Cambridge psychologist Nicholas Humphrey argued that the intelligence of pri- 
mates, including humans, had not been shaped by the need to subsist and avoid 
predators, but by the demands of social interactions. Social life was more arduous 
than physical life, and involved complex alliances, disputes, and deceptions, all of 
which would have shaped cognition and intelligence in specific ways (Humphrey 
1976). While Humphrey applied his ideas of the social function of intellect to 
primate behaviour, many other animals, including birds, have complex social 
interactions. 

Clayton and co-workers have studied the behaviour of the western scrub-jay, 
a bird of the family of the crows, ravens, rooks and magpies, all social birds. The 
scrub-jays, in common with other corvids, congregate in large groups and exhibit 
social relationships considered typical of sophisticated social cognition in pri- 
mates. Like primates, they have a long period of juvenile development, and are 
highly visual and diurnal. Scrub-jays catch and hide food, and have adopted a 
number of strategies to protect food from pilfering, including hiding the catches, 
and then removing the catches to mislead potential pilferers who might have ob- 
served the first hiding place. This assumes a highly developed ability to model the 
behaviour of other individuals. Corvids can perform some of the tasks of primates, 
including problem solving, and extracting the general rule governing the solution 
and applying it to a new problem, as well as making tools to solve a problem such 
as reaching inaccessible food (Clayton et al. 2007). 

Corvids have the largest brain for their size of any birds, equivalent to the size 
of primate brains, but lack the typical six-layered cortex of humans and other 
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mammals. This means “that elements of social cognition appear to have evolved 
independently in at least two very disparate groups of animals, namely the apes 
and corvids” (2007: 520). This difference in brain organization raises the question 
whether the cognitive abilities in widely divergent taxa have similar or different 
biological mechanisms and evolutionary pathways. 

Hauser, Chomsky & Fitch (2002) suggested that language should be broken 
down into components, some shared by animals and humans, and only one com- 
ponent, recursion, specific to humans. Recursion may be defined as the ability to 
extend a language by embedding phrases within phrases in sentences ad infini- 
tum. In their influential article The Faculty of Language: What Is It, Who Has It, 
and How Did It Evolve? they broached the question of language evolution by 
making a distinction between the faculty of language in the broad sense (FLB) 
and the faculty of language in the narrow sense (FLN). The FLB contains the 
components of language that are shared at one level or another by humans and 
other animals, and includes a sensory-motor component, a conceptual-inten- 
tional component, and some other unspecified systems. The FLN consists of the 
core grammatical computations containing, at a bare minimum, the capacity of 
recursion. 


“Investigations into the evolution of the faculty of language are confronted with 
a three-dimensional research space that includes three comparative-evolutionary 
problems cross-cut by the core components of the faculty of language. Thus, for 
each problem, researchers can investigate details of the sensory-motor system, 
the conceptual-intentional system, FLN, and the interfaces among these systems.” 
(Hauser et al. 2002: 1571) 


Breaking complex behaviour into components, some shared by humans and ani- 
mals, and others specific only to humans, is a fruitful approach to the study of 
language. This requires interdisciplinary cooperation between diverse disciplines, 
including evolutionary biology, anthropology, neuroscience, psychology, computer 
science, and linguistics. Such an approach, developed further by Fitch in The Evo- 
lution of Language (2010), encourages scholars to consider language not as one 
unitary trait, but as a package of characteristics shared to a lesser or greater extent 
by different species. This provides the opportunity to formulate testable hypothe- 
ses about given aspects of the overall, larger question. Even the one seemingly 
human characteristic of recursion may have evolved for reasons other than com- 
munication, possibly related to number, navigation, or social relations. Pinker & 
Jackendoff (2005) have challenged the idea that recursion is the only uniquely hu- 
man component of language, and no doubt the debate will continue. However, the 
value of interdisciplinary approaches to the question of language evolution would 
appear incontrovertible. 
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The search for language genes 


The development of new tools of molecular analysis in recent decades has provided 
a novel perspective on the genetic basis of language and cognition. A common ap- 
proach by medical scientists is to identify genetic variants in families whose mem- 
bers share a particular defect in speech, language and reading. One such family 
came to the attention of researchers in 1990. Half of the members of this three- 
generation family, known as the KE family, had a verbal dyspraxia, or inability to 
articulate speech correctly, consistent with an autosomal dominant mutation. The 
family members were subject to a battery of language tests, and considerable overlap 
was discovered between the affected and non-affected members in many tests in- 
cluding pronunciation, grammar, and IQ. The presumed affected members had 
clearly impaired performance in word and non-word repetition, and oral praxis 
(control of non-speech movements) compared with the unaffected individuals. The 
core defect was therefore associated with the ability to coordinate and time the se- 
quences of movements needed for speech, with some grammatical and semantic 
impairments which may, or may not, be related to the core defect. The defect was 
initially mapped to the long arm of chromosome 7, and later analyses led to the 
identification of the implicated gene, known as FOXP2. The affected members of the 
KE family were shown to have a mutation in one copy of the gene, indicating that 
two intact, functional copies of the gene are needed for speech (Fisher et al. 1998; 
Lai et al. 2001; Vargha-Khadem et al. 2005). The mutation is extremely rare, and it is 
now known that only about one in fifty people diagnosed with developmental verbal 
dispraxia have such point mutations (MacDermot et al. 2005). However, the discov- 
ery of this gene provided finally “a molecular entry point into neurogenetic mecha- 
nisms facilitating human spoken language” (Fisher & Scharff 2009: 166). 

The FOXP2/FoxP2 gene (in humans and other animals respectively) belongs to 
a family of evolutionarily related genes known as forkhead genes, which code for 
proteins with an important regulatory function in the transcription of many other 
genes. Following the identification of the human form of the gene, there was great 
interest in the orthologous genes in other primates to discover what was special 
about FOXP2 in humans. Analysis of the FOXP2 protein in humans, chimpanzee, 
gorilla, orangutan, rhesus macaque and mouse revealed that the protein is ex- 
tremely highly conserved across different species, indicating it has a fundamental 
biological role across the evolutionary spectrum. There are just three amino-acid 
differences between the human and mouse. The chimpanzee, gorilla and rhesus 
macaque proteins are identical, and differ from the mouse at one position and from 
the human at two. The discovery that the human form of the FOXP2 protein had 
accumulated two differences since the split from the chimpanzee, ca. 4 million 
years ago, while the protein remained constant in the chimpanzee, gorilla and 
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rhesus macaque for a much longer evolutionary time, implied that the human 
FOXP2 may have had an advantage that drove it to fixation in human populations. 
The time for the fixation of the human form of FOXP2 in humans was estimated at 
100,000-200,000 years before present, curiously similar to the time of the expan- 
sion of anatomically modern humans, and consistent with the view that this expan- 
sion was correlated with the beginning of spoken language (Enard et al. 2002). 

Was here, finally, the single mutation, accident, or Big Bang, that marked the 
transition to modern humans, and led to the development of full, syntactical lan- 
guage in our ancestors? Were the saltationists to be vindicated after all? It appears 
not. Despite the flurry of excitement surrounding the characterization of the 
FOXP2 gene and its possible implication for the evolution of language, most pa- 
tients with specific language impairments have functional FOXP2 genes. Con- 
versely, FOXP2 and its animal orthologues have an important role switching on 
and off many other genes, so it is unlikely that FOXP2 is responsible directly for 
language, but rather that it affects genes implicated in other neural pathways 
(Bishop 2002). 

The amino-acid sequence of FoxP2 and the way the protein is expressed in the 
brain are highly conserved from crocodiles to humans. Studies on songbirds have 
shown that the zebra finch and human forms of the protein vary at only eight 
amino-acids, equivalent to more that 98% similarity between humans and birds, 
despite a separation time of more than 300 million years between their respective 
lineages (Haesler et al. 2004). FoxP2 has been shown to be involved in the song 
circuit of song birds, and appears to be “switched on” in parts of the brain in peri- 
ods of song learning (Scharff & Haesler 2005). 

Li and colleagues (2007) studied FoxP2 in echolocating and non-echolocating 
bats, to discover if the protein was associated with the complex vocal signals that 
echolocating bats emit for orientation and to capture prey. Despite the high degree 
of gene conservation between the bird and human lineages, the FoxP2 of echolo- 
cating bats was found to be extremely variable, indicating that the gene had an 
accelerated rate of evolution in these bats, and providing further support for the 
role of FoxP2 in sensorimotor coordination. 

The function of FOXP2 was studied in a mouse model carrying a humanized 
version of the gene (a genetically engineered mouse with the human form of the 
gene). The mice were generally healthy, but exhibited minor changes in ultrasonic 
vocalizations and exploratory behaviour, as well as altered dopamine concentra- 
tion in the brain (Enard et al. 2009). The amino-acid substitutions in the human 
FOXP2 appeared to increase synaptic plasticity, dendrite length and connectivity 
in the cortico-basal ganglia, in other words improved the information processing 
capacity of the brain. In humans, this suggests that the FOXP2 gene mediates phys- 
ical changes that influence important aspects of motor control, cognition and 
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language. While it is clear that FOXP2 cannot be regarded simply as “the language 
gene’, research on its function and evolutionary history is helping to shed light on 
the complex nature of language. It is heartening to consider how, in just a few de- 
cades, clinical research on a rare speech syndrome led to the identification of the 
first gene implicated in language ability, and eventually opened up the first known 
molecular window into important areas of brain function in animals, and ultimate- 
ly into the evolution of human language (Fisher & Scharff 2009; Enard 2011). 


Could Neanderthals speak? 


Studies of the function of FOXP2/FoxP2 in humans and its animal orthologues 
have stimulated a remarkably fruitful range of questions regarding neuronal de- 
velopment and activity, motor skill learning, vocalization, and human evolution. 
The evolutionary modelling studies of Enard et al. (2002) had shown that the two 
specific amino-acid substitutions which accumulated since the split of the lineages 
of humans and chimpanzees (our closest living relatives) were fixed in the human 
lineage in the last 200,000 years, at the time of the appearance of the ancestors of 
anatomically modern humans. The finding that FOXP2 had undergone positive 
selection in recent human evolution was consistent with the role of the gene in the 
emergence of language, or at least in the recruitment of certain parts of the brain 
for speech (Corballis 2004). 

Enard and colleagues had solid biological evidence in favour of a “human 
revolution’, and proposed that the changes in FOXP2 had aided the global spread 
of anatomically modern humans and their ability to supplant the Neanderthals. 
This fitted well with earlier views of a recent origin and expansion of anatomically 
modern humans in Africa, and their total replacement of all presumed archaic 
humans in the rest of the Old World, most notably the Neanderthals in Europe 
(Cann et al. 1987; Stringer & Andrews 1988). This extreme Out of Africa with total 
replacement hypothesis was challenged by some scholars (see for example 
Trinkhaus & Shipman 1994), but linguists such as Philip Lieberman argued that 
Neanderthals became extinct because they did not have the capacity of speech of 
modern humans (1989). 

The question of the relationship between Neanderthals and anatomically 
modern humans has vexed scholars since the discovery of the original fossil in the 
Neander Valley near Diisseldorf, Germany, in 1856. In their wonderful book The 
Neanderthals: Changing the Image of Mankind, anthropologists Erik Trinkhaus 
and Pat Shipman (1994) chart the adventures and misadventures of the Neanderthal, 
and perceptions of this creature in academia and the public sphere. Debates 
on whether the Neanderthal was ape or human, evolutionary dead-end or our 
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intelligent half-cousin, have mirrored the changing perceptions of our own hu- 
manity in the past 150 years. The question whether Neanderthals could speak has 
had a central position in these debates. The arguments have recruited data from 
fossil anatomy, archaeological evidence of tool use, and assumptions on the length 
of childhood in Neanderthals, among many others (reviewed extensively in 
Stringer & Gamble 1993), and continue to the present, with periodic reinterpreta- 
tions of old fossil findings and assessments of new ones (Lieberman et al. 2002; 
Balter 2002; Balter 2009). The FOXP2 gene has become a new “hippopotamus 
major” in these arguments, arguably the one structure that could settle the debate 
on the uniqueness of modern humans. 

It seemed that molecular biology tools might provide the answer. Starting in 
the mid-nineteen eighties, molecular biologists were able to extract DNA from 
extinct animals and human bones (Higuchi et al. 1984; Hagelberg et al. 1989). In 
1997, scientists in Germany analyzed a small segment of mitochondrial DNA from 
the Neanderthal type specimen, and showed that the DNA sequence fell outside 
the range of variation of living humans. Once again, this was hard biological evi- 
dence for the view that Neanderthals were an extinct form of archaic humans who 
had not contributed to the gene pool of living people (Krings et al. 1997). Further 
improvements in DNA sequencing technology in the wake of the human genome 
project made sequencing faster and cheaper, culminating in the recent publication 
of a draft of the Neanderthal genome (Green et al. 2010). Armed with novel tech- 
nical and bioinformatic tools, it finally became possible to investigate the FOXP2 
of Neanderthals. 

‘The results showed that, in contrast to the expectations of the Out of Africa 
view of human evolution, Neanderthals had precisely the same FOXP2 variant as 
living humans (Krause et al. 2007). This indicates that the human FOXP2 was al- 
ready present in the common ancestor of Neanderthals and modern humans or, 
alternatively, that gene flow had occurred between the two. These findings were 
welcomed by those like Erik Trinkhaus, who would wish to award Neanderthals 
the “degree of human behaviour that has been evident in their burials and archae- 
ological record for a hundred years”. More importantly, these results “also permit a 
perspective on the emergence of modern human behaviour (including language) 
that sees it as a complex interwoven matrix, with a variety of biological and behav- 
ioral components.” (Trinkhaus 2007: R918). Once again, an important trait thought 
to be unique to modern humans was found to be shared with our ancestors. 

Unfortunately, despite the novelty and importance of the findings, the ques- 
tion of the uniqueness of modern humans and their language remains unanswered. 
On the one hand, the functions of FOXP2 are multiple, and ancient in the evolu- 
tionary record, and the gene cannot be reduced to the gene for language. On the 
other hand, as was pointed out by Benitez-Burraco and colleagues (2008), the 
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Neanderthal FOXP2 data could be interpreted in a variety of ways, not all consis- 
tent with the Neanderthals being able to speak. Either way, the FOXP2 gene by it- 
self is unlikely to resolve the differences between the saltationist and the non- 
saltationist camps. Yes, two amino-acid changes accumulated between the 
chimpanzees and ourselves, and we appear to share these differences with the Ne- 
anderthals. But we are a long way from understanding the function of the gene and 
the significance of the two mutations. 

As we have seen, the debates on the origins of language and the uniqueness of 
language continue unabated, and have been given new impetus by the availability 
of novel scientific tools. However, this is not a matter of regret, as these debates 
continue to provoke new and fascinating research questions that will engage the 
attention of scholars in different disciplines for many decades to come. 
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Language as technology 


Some questions that evolutionary 
linguistics should address* 


Salikoko S. Mufwene 
University of Chicago 


Over the past two decades, studies of the phylogenetic emergence of language 
have typically focused on grammatical characteristics, especially those that 
distinguish modern languages from animal communication. The relevant 
literature has thus left the reader with the impression that language is either 
exclusively or primarily mental; in the latter case, its physical features, phonetic 
or manual, would be epiphenomena that may be overlooked. I argue that 
language is natural collective technology that evolved primarily to facilitate 
efficient communication in populations whose social structures were becoming 
increasingly more complex. It emerged through hominines’ exaptation of their 
own anatomy, thanks to the same mind that was enabling the complex cultures 
they were producing. Linguistic constraints of various kinds are emergent 
properties that are largely consequences of the modalities used, a position that 
does not expect signed languages cum legitimate linguistic systems to replicate 
the general architecture of spoken languages in all respects. The rest of the paper 
speculates on how the architecture of spoken languages evolved, gradually, with 
some features presupposing the prior emergence of others, whereas some others 
conjure up concurrent emergence. The facts suggest a complex non-unilinear 
evolutionary trajectory with many alternative options, consistent with emergent 
technologies in which considerations of optimality are absent. 


Introduction 


‘The title of this essay is a development from my ecological approach to the phylo- 
genetic emergence of language in mankind (Mufwene 2008, 2009, in press). 


* 


I am grateful to Diane Brentari, Evani Viotti, Terje Lohndal, and Cécile B. Vigouroux 


for useful comments on the last draft of this chapter. I am alone responsible for the remaining 
shortcomings. 
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According to it, language evolved gradually as an adaptive hominine response to a 
life style that was becoming more cooperative and gregarious. Our hominine an- 
cestors then lived in bands that were becoming larger and more complex; they 
needed more explicit communication to manage their relations and communities. 
The same mind that had led to this new social structure responded to the con- 
comitant ecological pressure for improved communication by domesticating the 
parts of the hominine anatomy that appeared the most suitable for the purpose. 
Taking into account communication among nonhuman primates and modern hu- 
mans’ body language today, we may assume that both the hands and the buccopha- 
ryngeal structures had then already been coopted for communication by gestures 
and vocalizations, respectively (Corballis 2002, 2010, 2011). The next step was to 
co-opt them for richer and more explicit exchanges of information in ways that 
were less context-dependent in their interpretation.! The new social ecological 
pressures that invited a hominine adaptive response must have included, among 
other things, the need to talk about past experience, making it inter-individually 
relevant to solving problems in the present and planning future group activities, as 
well as explaining consequences of their behaviors. 

The position developed here is consistent with current hypotheses about the 
emergence of modern language, some 50,000 years ago, coinciding with the emer- 
gence of rupestrian paintings and the late Homo sapiens’ dispersal out of East 
Africa (Lieberman 2010). I conjecture that “everything” in this phase of the pro- 
tracted evolution of language was being driven by a mind then endowed with a 
rich representational capacity and certainly capable of exapting extant means of 
communication to meet growing social pressures on group members to share their 
feelings, thoughts, and plans for the purposes of harmonizing their existence. Thus 
the emergent means of communication had to meet what Hockett (1959) charac- 
terizes as “displacement,” a trait apparently unique to human languages, which 
enables us to talk not only about the present but also about the past and the future, 
and thus to develop narratives that have world-creating power. Thus, we can not 
only talk about facts but also engage each other in constructing fictional and 
mythical universes brought to light not by paintings but with the contents of words. 
I submit that this is how symbolic communication (de Saussure 1916, Deacon 
1997) emerged, consisting of arbitrary signified-signifier or meaning-form symbi- 
otic pairs that can be combined in various ways to express diverse information 


1. Asis evident from the scholarship in pragmatics, especially Dan Sperber and his colleagues’ 
work on “mind reading” (e.g., Sperber & Wilson 2002), freedom from context in the interpreta- 
tion of linguistic utterances is relative, not absolute. We cannot deny the reality that some forms 
of communication are more context-dependent than others. It is in this spirit that my statement 
about the adaptive emergence of explicit, less context-dependent language must be interpreted. 
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about activities, emotions, thoughts, experiences, conditions of entities, etc., and 
to situate them in space and time. 

Below, I explain why, within this particular approach to the emergence of lan- 
guage, it appears justified to treat both speech and signing as technologies, albeit 
collective ones that evolved spontaneously (without anticipation of the ultimate 
systemic outcome) and piecemeal, more by exaptation and expansion of current 
materials (forms and structures) than by inventions ex nihilo.2 None of these as- 
sumptions are at variance with the traditional conception of languages in linguis- 
tics as systems consisting of units of various sizes and of rules and constraints 
governing how they can be used to express various kinds of meanings thanks to 
particular morphosyntactic combinations. I argue that in fact the approach makes 


2. Istarted thinking of “language as technology” in spring 2010, while reading Brian Arthur's 
(2009) very informative book On the nature of technology: What it is and how it evolved. In May 
of the same year, I articulated some of the ideas developed here in a keynote address I gave at the 
“Conference on Evolutionary Linguistics” at Nankay University, Tianjin, China. I was more en- 
couraged to pursue the idea after noticing that Namhee Lee et al. too had thought of language as 
technology, in their (2009) book Interactional instinct in which they focus on language as a com- 
plex adaptive system but do not develop the technology idea. As this essay was nearing comple- 
tion, I decided, out of curiosity, to google “language as technology” to determine whether I was 
not reinventing the wheel by any chance. My search revealed the following publications, which 
still do not make the orientation of my endeavor irrelevant or redundant: (1) “Language as a 
cognitive technology” (2002) by Marcelo Dascal; (2) “Language as a cognitive technology: Eng- 
lish-speakers match like Piraha when you don’t let them count” (2008) by Michael E. Frank, 
Evelina Federenko, and Edward Gibson; and (3) “Ceaseless, unpredictable creativity: Language 
as technology” (2009) by Jan Koster. Of the three, the last is the closest in spirit to the present 
essay. However, although we both underscore the significance of phonetics and manual signs as 
materials that languages are made of, as well as the culture-specific ways in which their architec- 
tures vary, we focus on different aspects of the technology. Koster focuses more on the centrality 
of the word and how grammar evolved because of how it is used to convey information, where- 
as I focus on how the overall architecture of the technology evolved in mankind and what par- 
ticular trajectories the evolution may have followed. This explains also why I will hardly return 
to his insightful article below, though I share most of his views. The other two essays have to do 
with how language is coopted to carry out cognitive processes and are therefore peripheral to 
Koster’s and my concerns. 

On the other hand, just as I was completing the revisions of this essay, an accident of a 
completely independent reading led me to Douglas McArthur’s (1987) short article “Le langage 
considéré comme une technologie,” in which he sketches a position very similar to mine, but did 
not elaborate. He states in the English abstract of the paper that language can be considered as 
“a collection of tools and methods, a ‘technology’. Like other technologies[,] it is the product of 
human creativity, is elaborated over a period of time, and continues to change according to 
needs” (p. 157). He says on pages 159-160 that it can be characterized as “semiotic technology” 
or “information technology,’ albeit one with a “biological substratum.” I obviously have more 
space to elaborate the position, including his argument for a polygenetic emergence of lan- 
guages, in the plural. 
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it easier to also think of languages as cultural artifacts whose protracted emer- 
gence can be accounted for on the model of cultural evolution, though the role of 
biological predisposition for language (evident in hominine mental capacity and 
particular anatomy) can certainly not be discounted. The approach suggests that 
typological variation is a natural consequence of languages as cultural produc- 
tions, directing attention to parallels between variation in some aspects of linguis- 
tic structures with that in the organization of other cultural domains such as 
kinship, cooking, music, belief “systems,” clothing/dress code, and architecture. In 
fact paleontological evidence suggests that we speak of the evolution of languages, 
in the plural, rather than just of language, in the non-individuated style of refer- 
ence (see Section 3.2, below). 


2. Do languages really qualify as technology? 


The position defended in this essay depends centrally on what is meant by “tech- 
nology.’ As explained by Arthur (2009), technology need not be material, it need 
not have a complex structure (though languages do), it need not be monolithic, it 
need not have been planned, it need not be a conscious invention, it may have 
acquired this status in an ad-hoc fashion, it need not work perfectly and can be 
improved several times on different occasions, and it can be adapted to new uses. 
“Technology (...) is a means to fulfill a purpose: a device, or method, or process” 
(p. 29). It can be knowledge (when used to explain or handle something else), 
practice (when extended to the solution of a problem), or science (as in science and 
technology, when scientific knowledge is applied to develop practical things, in 
order to make our lives easier or to help us solve problems, p. 27). 

My position depends largely on what LANGUAGE is understood to be. In fact, 
from a phylogenetic perspective, I have in mind Lancuacgs, indeed in the plural, 
consisting of phonologies, vocabularies, and morphosyntaxes that vary from one 
population/culture to another. I will often downplay the non-individuated ab- 
straction LANGUAGE, conceived of on the basis of common characteristics of lan- 
guages, especially the most abstract ones that have to do with Universal Grammar 
(for those who posit it), without any mention of materiality of the sounds or man- 
ual gestures engaged in the systems (see also Koster 2009; Evans & Levinson 2009). 
From an evolutionary perspective discussing languages without any reference to 
their physical components would be like talking about computers without any ref- 
erence to their hardware or trying to make sense of how the mind works without 
any reference to the (activities of the) neurological circuitry in the brain. It is tan- 
tamount to ignoring the reason why all the principles that have retained the atten- 
tion of linguists in such scholarship emerged, just like ignoring the constraints that 
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the hardware imposes on the software of a computer (by way of facilitating or im- 
peding some processes) and the brain on the capacity of the mind. 

There would have been no languages for linguists to discuss today if homi- 
nines had not coopted vocalizations and gestures to produce speech and signs, 
respectively, which have made symbolic communication possible. Languages are 
systems whose elemental units are physical, to which various rules and constraints 
apply (on various levels of their architecture) and with which semantic interpreta- 
tions are associated. The elemental units are sounds in the case of speech and 
handshape, orientation, and movement, among other components, in the case of 
sign languages.*? Languages have non-monolithic architectures not only because 
the scopes of application of their rules overlap in various ways (Mufwene 1992) 
but also because they consist of material and abstract (computational) parts, in- 
deed like computers, consistent with Arthur’s (2009) conception of technology 
(e.g., p. 31). 

As a defender of polygenesis, I submit that languages are technologies that dif- 
ferent populations, at different evolutionary stages of the hominine phylogeny, 
developed piecemeal in response to their social pressures for richer and more ex- 
plicit communication. They are productive systems (a term hardly applicable to 
animal means of communication) which possess principles and constraints gov- 
erning how to form larger units from smaller ones and how to form meaningful 
utterances. Some of the principles are consequences of the particular modalities 
involved, especially compositionality, which is a consequence of the discrete na- 
ture of their elemental units and the linearity they impose on utterances, although 
linearity is less strictly implemented in sign language than in speech. Culture- 
specificity is evident in variation within the particular size and composition of 
phonetic sounds used in a language, though a core of them are attested in all 


3. Iwill avoid the mistake of trying to break down sign language in a way that matches speech 
on every level, because, as explained below, each modality in itself is an inherent ecological fac- 
tor that constrains the kind of system that emerges. It determines what the most elemental units 
can be, how they can combine into more complex units and what combinations are not possible. 
For instance, speech cannot be iconic in the same way that sign language can (as the former 
depends on hearing and the latter on vision to be perceived and interpreted), nor need the latter 
be as strictly linear as speech is. On the other hand, the “strict linearity” of speech depends on 
the articulators involved, since co-articulated sounds (e.g., labiovelar stops) are possible. The 
co-occurrence of segmental and supra-segmental sequences does not dispute the essence of my 
claim, because they are produced in parallel and do not engage the same articulators. The supra- 
segmental elements of an utterance are also structured linearly. For instance, a vowel cannot 
have a high and a low tone at the same time, though it can have both sequenced, as a falling or 
rising tone. Much of the discussion below will focus on segmental sequences. The reader should 
remember that sign languages have their prosodic features too, which are equally ignored in the 
present discussion. 
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languages (such as the vowels /i, u, a/), in the phonotactic and other phonological 
principles that have been adopted, in word-formation strategies, and in syntactic 
constituent order and other conventions. Thus, spoken languages vary typologi- 
cally not because the essential architectural principles of the linguistic technology 
(produced by phylogenetically the same human mind co-opting the same anato- 
my) are different, but because there was latitude for variation in the choice of basic 
units (perhaps less in sign language, consistent with my explanation below) and in 
the specifics of the emergent principles. Once the basic units and principles were 
adopted, it did not matter how a particular population developed the rest of the 
emergent system. This kind of variation is observable in other cultural domains, 
such as cooking, music, kinship, and courtship, except where more complex mate- 
rial technology is involved. In modern technological terms, it is like the variation 
observable in the word-processing industry, as one particular software enables the 
user to do certain things in ways somewhat different from another software, while 
they produce comparable results.* 

Architectural differences between speech and signing deserve some attention, 
because the modalities used impose their own specific constraints on the architec- 
ture of the emergent systems. While the HANDSHAPE, PALM ORIENTATION, and 
MOVEMENT and PosITION of the relevant articulators make it possible to tell one 
word-level sign from another, they are comparable as much to sounds (in that, be- 
ing elemental, the latter help us tell one phonetic word from another) as to mor- 
phemes, because the signing parameters often contribute to distinguishing the 
meaning of one signed word from another in the way spoken morphemes do, 
when the composition is transparent. For instance, the motion of an entity can be 
signed differently with the hand(s) assuming the corresponding shape (open or 
closed), direction (forward or backward), and orientation (up, down, horizontal, 
etc.), and according to whether or not this articulator is kept steady (rolling, 
bouncing, weaving, or a steady plane, etc.). To be sure, such iconicity and morpho- 
logical transparency are not typical of the whole vocabulary of a sign language. In 
any case, speech is incapable of such iconicity; these are distinctions that are 


4. Evans & Levinson (2009: 429) state that the putative universals of language are “few and 
unprofound” and that “languages vary radically in sound, meaning, and syntactic organization.” 
They may thus disagree with my still relatively conservative conclusion. Semantic differences 
across languages remain what they are, in that words and phrases do not convey identical mean- 
ings crosslinguistically, although most of them can translate each other in ways multilingual 
speakers can live comfortably with. I am focusing here on the simple fact that these differences 
do not prevent speakers of particular languages from communicating about themselves and 
their ecologies as adequately as those of others do. In fact, they (can) use their respective, cul- 
ture-specific strategies to communicate about other cultures without modifying their respective 
ways of packaging information, which justifies the saying “lost in translation.” 
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typically expressed linearly rather than synchronously, by agglutinating mor- 
phemes to a base in some languages or periphrastically in others, with adverbials 
or in serial verb constructions.° 

Assuming that sign languages must have an architecture similar in all respects 
to that of spoken languages, with phonemes as the elemental units (which com- 
bine into morphemes and the latter in turn into words, which combine into 
sentences), is about as good as expecting the motion dynamics of a lizard to be 
identical to the slithering of a snake or perhaps like expecting a runner and a 
swimmer to race toward their targets using the same techniques. Because it is 
based on a different modality than speech, sign language need not be expected to 
have the same kind of architecture as speech in order to function as adequately as 
it does as a means of communication.® Having different kinds of minimal units 
and on a level that seems to conflate phonology and morphology in speech does 
not make signing less linguistic. Actually, the difference should help us articulate 
more accurately what it takes for a communicative system to be linguistic. This 
may have to do more with symbolism than with how the symbols are formed, just 
like a wall in a house remains functionally a wall regardless of whether it is built 
out of stones, concrete, or wood and the finish consists of plaster or drywall. 

There are of course things that one can do with one particular piece of alterna- 
tive technology but not with the other; and it appears indeed that, evolutionarily, 
speech has not been more advantageous in all respects. There are situations where 
it is safer signing than speaking, such as when communicating under water or 


5. As we are reminded of by McCleary & Viotti (2010: 198), Charles Hockett was well aware 
of the fact that the architecture of sign language need not be conceived of on the exact model of 
speech. Signing is not arbitrary to the same extent that usage of phonetic materials forces speech 
to be, and in fact it is not strictly linear. “[T]he dimensionality of signing is that of life itself, and 
it would be stupid not to resort to picturing, pantomiming, or pointing whenever convenient” 
(Hockett: 1978: 274-275). From the point of view of processing the coded information, visual 
perception need not be constrained by the same factors as aural perception. These are ecological 
factors that bore on the evolutionary emergence of linguistic technologies. 


6. This is not to deny that the two kinds of linguistic systems do also share some features, such 
as in being modular, using symbolic words/signs (speech perhaps more extensively than sign- 
ing, which allows more iconicity), and, among other things, in resorting to combinatorics in 
order to express ideas more complex than those expressed in individual words/signs. The issue 
that I raise here is about whether “duality of patterning,” or “double articulation” (in Martinet’s 
1960 terminology), applies (exactly) the same way in both kinds of systems. The reader should 
note that I am not at all questioning the existence of “duality of patterning” in sign language. 
Even if the latter did not really have a phonological level (a position apparently not so accurate 
either), the morphological and the lexical levels alone would still justify positing the “duality” 
This term, like “double” in “double articulation,’ is a misnomer for a stratification of more than 
two levels in spoken languages: at least sounds, morphemes, and words. 
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hiding from a predator (Mufwene, in press), though I focus below on architectural 
differences that are consequences of modality differences, thus on the fact that dif- 
ferent materials have been used in the communication technology. For instance, 
because hands can imitate shapes and motion better than sounds, they can pro- 
duce more iconic signs than our speech organs (Hockett 1978, Goldin-Meadow 
2003, McCleary & Viotti 2010), although even iconic signs reflect some arbitrari- 
ness in the form-meaning pairing, as is evident from onomatopoeias in spoken 
language.’ Speech would perhaps contain more acoustic icons if it were not pho- 
netic, ie., consisting of sounds that mean nothing alone and favor arbitrariness 
over iconicity in the ways they combine into words. A particular limitation with 
onomatopoeias is that they often consist of sound combinations that are marked 
compared to those used in the vast majority of words in specific spoken languages, 
for instance bilabial and dental clicks in several non-click languages. This is not 
the case with iconic signs in sign language, precisely because signed words are not 
composed by the same principles as spoken words. The most basic units used in 
the signing technology are not always the counterparts of phonetic sounds. The 
same parameters of HANDSHAPE and MOVEMENT, etc. that appear to have a mor- 
phemic status in iconic signs are used with no particular meanings in the more 
arbitrary signs. It appears that the morphological and the phonological levels are 
conflated into one heterogeneous module of discrete elemental units that do not 
have a uniform structural status in all words (semantic in some but strictly formal 
in others). “Duality of patterning” applies in that the elemental units help tell one 
word from another, regardless of whether they are interpretable as morphemes or 
counterparts of phonemes in spoken languages, and then each word helps tell one 
utterance from another in a discourse.® 


7. According to Aronoff et al. (2005), iconic signs are universal across sign languages and 
presumably less arbitrary. I think this difference in the faithfulness of iconicity and even in the 
extents to which it is exploited in the lexicon, not necessarily in the syntax, reflects the greater 
capacity of signing to encode more transparently the physical properties of the universe of 
reference than speech. It is consistent with the position being developed in this paper, viz., that 
some of the architectural properties of languages are consequences of the particular modality 
that is used. 


8. This difference should not be confused with the fact that spoken languages also have mor- 
phemes that consist of one sound only, as at the beginning of a-like and the end of fuzz-y in 
English, or the fact that such words also have a certain amount of transparency that makes them 
somewhat comparable to iconic words in sign languages. The fact is that TRANSPARENCY and 
ICONICITY do not mean the same thing and the proportion of iconic words in sign languages is 
much higher than in spoken languages even if the iconic signs reflect some cultural arbitrariness 
and constitute only a subset of the lexicon. As noted by Hockett (1978: 274-275), “it would be 
stupid not to resort to picturing, pantomiming, or pointing whenever convenient.” This option 
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It is apparently because of the same architectural differences discussed above 
that signed language is less bound by linearity than speech. The parameters that 
are combined into iconic signs are capable of expressing meaning, like MOTION UP 
Or MOTION DOWN through the inclination and position of the hand, just as the 
kind and direction of the motion of a referent can be expressed concurrently by 
the way the HANDSHAPE (cum classifier) representing the referent is moved within 
the space used by the signer. Even morphosyntactic incorporation in speech can- 
not do this, because speech is not produced in a tridimensional space. Besides, as 
noted above, speech articulators are incapable of producing two sounds concur- 
rently, let alone two morphemes or words, whereas each of the two hands may be 
engaged in signing two different ideas. This is a significant compensation for sign- 
ing, because it would otherwise be too slow if it operated on the phonetic, strictly 
linear model. The size of the articulators and the larger space in which signs are 
produced (compared to speech) would alone, independent of how syntax is han- 
dled, make the technology too energy-consuming and less fluid. (See also Swisher 
1988: 350 for a similar position.) So, signing is definitely a different kind of tech- 
nology. As I show in Section 3.1, it is not patterned on speech, though there are 
similarities in the ways they relate form to the universe of representation or refer- 
ence (see Section 3.2) and in the general principle of combining smaller signs into 
larger ones (phrases and sentences), in order to convey more complex meanings.” 


makes communication in the manual modality particularly informative and efficient in avoid- 
ing the longer time that the linear sequencing of signs would entail. 


9. As explained by Brentari (2011), according to scholars of sign language, “the root node at 
the top of the structure’ where the parameters HANDSHAPE, PLACE OF ARTICULATION, 
MOVEMENT, ORIENTATION, and NONMANUAL BEHAVIOR combine “is an entire lexeme, a stem, 
not a consonant- or vowel-like unit” (691-692). The lexeme is often polymorphemic, though it 
is typically monosyllabic, because signed language is not strictly linear. The pervasive iconicity 
enabled at the putative phonological module by parameters such as HANDSHAPE and MOVEMENT 
conjures up morphology rather than phonology in speech, where phonemes help tell mor- 
phemes or words apart but do not typically convey specific meanings alone. 

However, faithful to her research paradigm, Brentari concludes that the phonologies of the 
signed and spoken languages differ physically but “[t]he more focus there is on the mind/brain 
the more sign language and spoken language phonologies will look the same” (719-720). 
Certainly, like that of spoken language, the architecture of signed language is modular and mul- 
tistratal (consistent with the “duality of patterning”), especially from the point of view of 
production, involving combinations of smaller units into larger and larger ones (from the pa- 
rameters to lexemes and then onto phrases and sentences). However, this architecture need not 
involve an organization that matches that of speech in all respects. There are alternatives that 
only one modality can favor but not the other. The fact that signed language (not including fin- 
gerspelling, an imitation of English spelling) has a large proportion of elemental units corre- 
sponding to morphemes rather than to sounds at the putative phonological level makes all the 
difference, especially since those corresponding to sounds apparently fall in the minority. Note 
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After all, they are emergent productions by the same mind associated with the 
same late Homo sapiens. 

Those of us working on the evolution of language have often wondered why in 
all human populations, the majority have settled on speech rather than signing; in 
fact the latter has typically served as a fallback option for those who cannot pro- 
duce speech, either because they have a hearing impairment (which deprives them 
of the input) or because they have lost the ability to speak after a particular injury 
or trauma. The question is especially significant, because, based on the communi- 
cative practices of nonhuman primates, it has been hypothesized that intentional 
communication among hominines probably started with gestures, the distant an- 
cestor of signing (Tomasello & Call 2007, Corballis 2011, Pollick & de Vaal 2007), 
as explained in the next section. 

‘The reasons why speech has prevailed demographically over signing are nu- 
merous and will not be discussed exhaustively here. They include the following: 
(1) Speech has freed the hands to do other things, such as carrying objects or using 
tools, concurrently with a communicative activity. (2) Speech does not require the 
listener to be in the visual fields of the speaker, whereas the addressee must be, in 
order to see the signs produced by the signer. This also provides the advantage of 
being able to communicate in the dark or across visual barriers that do not consti- 
tute sound barriers. (3) The broadcast capacity of speech has a wider range than 
that of signing (MacNeilage 2008). The speaker does not have to face the address- 
ee, whereas the signer must. According to Dunbar (1996), this broadcast capacity 


that the transition variation observed with the sign language parameters could be matched as 
much with allophonic as with allomorphic variation in speech; it constitutes no compelling 
evidence for positing an autonomous speech-like phonological module in signed language, at 
least from the technological perspective developed in this paper. Even the fact that the phono- 
logical level in sign language is predicated on analyzing HANDSHAPE, LOCATION, MOVEMENT, 
etc. by invoking distinctive features rather than on positing phoneme-like segments 
(Diane Brentari, p.c., 2/27/2012) suggests that distinctive features need not be associated exclu- 
sively with phonetics/phonology. Their function is to enable the distinction of minimal segmen- 
tal units from each other. Although the elemental units happen to be sounds in spoken language, 
the monosyllabic units that the distinctive features help distinguish in sign language need not be 
the counterparts of sounds, as is evidenced by the preceding discussion. Thus, what has been 
identified as phonology appears to be conflated with morphology. Each modality imposes its 
own constraints on the development of the technology. The status of signed language as 
LANGUAGE is far from being diminished by the absence of an autonomous or extensive phonol- 
ogy component. It just makes it different in its architecture, though signed language shares so 
many other features with spoken language, especially regarding compositionality, “duality of 
patterning” (but in its modality-specific way), arbitrariness and symbolic nature of denotational 
signs (to different extents), displacement, etc. It has as much world-creating power in its narra- 
tives as speech, though there are certain things that one modality probably expresses more viv- 
idly than the other. 
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enables the speaker to “groom” (interpreted here as ‘socialize’) with several rather 
than with one other person at a time. (4) Speech can be amplified (as in screaming 
or shouting) to carry father more successfully than signing can, in order to be 
perceived from distance.'® 

On the other hand, signing has its advantages: (1) As noted above, it can be 
more transparent and informative in the description of some events (Goldin- 
Meadow 2003). (2) In the proximity of predators, a signer is safer than a speaker 
who would be giving away information about their location. The same is true of 
hunters, who would prefer to surprise the game rather than alerting it (John W. 
Wenzel, p.c., 1/24/2009; Mufwene, in press). Sometimes even whispering can be 
too loud. (3) Under water, it is certainly smarter to sign than to try to speak (John 
W. Wenzel, p.c., 1/24/2009). Evidently, we spend much more time out of and on 
water than under water. Like modern humans, our hominine ancestors must have 
socialized more frequently than they hunted; and they must have avoided living in 
the proximity of predators and other dangerous animals. So, ecological reasons 
must have stacked in favor of speech. (4) Like Swisher (1988), Mufwene (2009, in 
press) and Corballis (2010) speculate, in addition, that speaking uses less energy, 
as it depends on compact articulators that move in a much smaller space, and 
proceeds faster than signing. This is, however, an argument that holds only with 
regard to linearity. As explained above, signing compensates for this apparent 
shortcoming in not being absolutely linear, though bilingual people who speak 
and sign fluently are better placed to decide on the validity of this speculation. 


3. How did the linguistic technology emerge phylogenetically? 


3.1 ‘The transition from vocalizations to phonetic utterances 


The toughest part of this question has always been how speech, a discrete or digital 
communication system, could have evolved from holistic vocalizations of the kind 
observable in today’s nonhuman primates and certainly in our cries and laughter 
(Rousseau 1755). The question was poorly answered by such scholars as Carstairs- 
McCarthy (1999) and Wray (1998, 2000, 2002), as if the transition took place sud- 
denly from holistic vocalizations to modern phonetic inventories. This explains 
why their accounts have been disputed, especially by Johannson (2005), Tallerman 
(2007), and Bickerton (2007), though Carstairs-McCarthy’s and Wray’s answers 
are not totally groundless. Likewise, one must account for how signed language 


10. Signing can of course be amplified by technology outside the communicator’s anatomy, 
such as projections on a large screen. However, usage of such technology is a much later inven- 
tion not accessible to all nor on all occasions; it is therefore irrelevant to the present discussion! 
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could have evolved from gestures (McNeill 2005). The questions are now further 
complicated by the realization that the evolution to speech, which is intentional, 
must not have been that straightforward, because nonhuman vocalizations are no 
more intentional than ours (e.g., cries and laughter), which are innate and realized 
instinctively. I submit that ape-like vocalizations must first have been co-opted for 
intentional use before being modified and expanded into speech. 

In the case of speech, the question is whether the following position is plau- 
sible: as hypothesized by Rousseau (1755), the relevant hominine ancestors 
would have started to modify their vocalizations into distinctive, recurrent syl- 
lable-size chunks, which would have led to the gradual emergence of different 
vowels and consonants and would have set in place what MacNeilage calls “syl- 
labic variegation.” These would have evolved to the capacity to distinguish syl- 
lables from each other by some of the segments they consist of and not just by 
holistic syllable-size vocalizations. Both evolutions, concurrent or successive, 
would have resulted in the explosion of the vocabulary and the need to develop 
some syntax of sounds within words (what amounts to phonology), consistent 
with Hockett’s (1959: 38) speculation of why “duality of patterning” is a “design 
feature” of human languages. 

The contrary, “synthetic view” proposed by Johannson (2005), Bickerton 
(2007), and Tallerman (2007), according to which “protolanguage” would have 
started with words, which eventually would be combined into sentences thanks to 
the emergence of syntax,” must still tell us how the words, produced phonetically, 
started. It is necessary to account for the transition from prephonetic words to the 
phonetic words, which brings us back to the question addressed not so success- 
fully by Carstairs-McCarthy (1999) and Wray (1998, 2000, 2002). The latter’s posi- 
tion does not dismiss the fact that modern speech no longer functions like ape-like 
and humans’ most primitive vocalizations, as it serves more complex communica- 
tion. However, we must bear in mind that our prelinguistic hominine ancestors 
did not have the same mental capacity as modern Homo sapiens and that the evo- 
lution that innovated phonetic communication did not then necessarily call for 
the kind of complex communication modern humans engage in today. The evolu- 
tion that produced the late Homo sapiens also produced more complex minds that 
must have helped hominines evolve more complex phonetic communication, in- 
cluding various aspects of modern morphology and syntax, as I explain below. 
Hypotheses about hominine evolution based on paleontological evidence do not 
support hypotheses of an abrupt jump from ape-like vocalizations to even the kind 
of protolanguage proposed by Bickerton (1990ff.). 

The objections raised by Tallerman (2007) against Carstairs-McCarthy (1999) 
and Wray (1998, 2000, 2002) include the following: vocalizations are not produced 
by the same part of the brain as speech (p. 582); they are not intentional/volitional 
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(p. 583); they are produced as much by outbreath as inbreath among nonhuman 
primates, whereas they are produced only with outbreath among humans (p. 583; 
actually implosives use inbreath!); they are innate (p. 584); and they lack duality of 
patterning (p. 584). 

However, if every capacity that has been exapted for speech, in the form of 
domestication and extension of functions, had to remain the same, there would 
have been no evolution. Like any other aspect of evolution, something had to be 
co-opted from somewhere, including the intentional production of vocalizations 
and their gradual modification into digital sequences. It seems plausible to assume 
that the mechanisms involved in holistic vocalizations were exapted to do more, 
although intentionality must have been transferred from gestural behavior. 

To be sure, as suggested above, the proposed segmentation or “fractionation” 
need not have started with the gamut of phonetic distinctions now observable in 
modern languages. It need not have started with Consonant-Vowel sequences 
such as pa and ma, and certainly not with the complex hypothetical phonetic se- 
quences proposed by Wray (e.g., tebima and kumapi); and there was no reason 
why Tallerman should have been disappointed that Wray’s “holistic protolan- 
guage” would not have been able to handle many semantic distinctions. By defini- 
tion the protolanguage should not be expected to be systemically and semantically 
as rich as a modern language, especially in its initial stages. 

Chances are that the initial variegation that Wray (1998) wanted to illustrate 
(with the wrong examples!) started with vocalic distinctions within or between 
vocalizations. Just inserting a glottal stop (or pause) and modifying the vocalic 
quality would have sufficed to innovate an embryonic digital phonation system 
from continuous ape-like vocalizations. The evolution must have proceeded grad- 
ually, and undoubtedly by trial and error, in attempts to modulate the vocaliza- 
tions not only melodically (as nonhuman primates do) but also qualitatively 
(as humans do with vowels). Once the principle of vocalic variegation was in place, 
facilitated later on by the insertion of a consonant between the early vowels, the 
phonetic inventory would grow only proportionately to the communicative pres- 
sures experienced by the community of interactants. These pressures themselves 
were triggered by the increasing cognitive capacities of the relevant hominine an- 
cestors and by the complexification of their social structures." 


uu. [have no expertise on signed language and won't speculate in the dark about how the transi- 
tion from gestures to signs must have proceeded. We simply must remember that intentional 
communication appears to have started with gestures earlier than with vocalizations. On the 
other hand, as argued above, the architecture of signed language need not be patterned on that of 
speech. Thus the question of transition might not have to be addressed in exactly the same way. 
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3.2 The complexification of spoken language 


As explained in Mufwene (in press), the phonetic inventories of the emergent lan- 
guages must have expanded in response to communicative pressures for more and 
more lexical distinctions. It may have started with contrastive monosyllabic utter- 
ances, which may or may not have been symbolic yet. However, the relevant 
hominine ancestors would have realized soon that more signs could be produced 
without having to expand the phonetic inventory too extensively if words con- 
sisted of more syllables, in alternating and/or (partly) reduplicating sequences. 
Although many modern languages have managed well with large proportions of 
monosyllabic words, especially with the help of phonemic tones (which account 
for semantic distinctions between segmentally identical words), it is equally note- 
worthy that several, if not most languages, rely heavily on words longer than one 
syllable. This is especially significant in languages such as Hawaiian with a small 
segmental inventory (10 vowels and 8 consonants) that are also non-tonal. Varia- 
tion in the ways that the phonetic sounds have been used in the architecture of 
different languages, both regarding their specific phonetic inventories and the re- 
lated combinatorial constraints, underscores the status of languages as cultural 
artifacts, notwithstanding the significance of humans’ biological predisposition 
for this technology. 

Whenever polysyllabic words emerged phylogenetically, I surmise that, just 
like children learning the language of their social environment, the hominines that 
evolved speech did not first spend their time developing a full phonetic inventory 
and wait until it was in place to start producing words. Johannson (2005), Bickerton 
(2007), and Tallerman (2007) are correct in drawing attention to the centrality of 
words in the phylogenetic emergence of language. It is more likely that, as in child 
language development, the phonetic inventories and the vocabularies grew con- 
currently, as it became necessary to use more and more units to coin more and 
more (minimally) contrastive words. This concurrent evolution must have pro- 
ceeded until the phonetics reached its critical mass and speakers thought that the 
extant inventories offered them relatively limitless possibilities to produce the new 
words they needed. 

We must bear in mind that there is no standard size that satisfies all popula- 
tions/cultures; different language communities have managed equally well with 
fewer or more segments than others, resorting to combinatorial constraints that 
are not identical crosslinguistically, although there are some universals both re- 
garding some segments (notably the cardinal vowels /i, u, a/ and voiceless bilabial 
and coronal oral and nasal stops), as well as particular combinatorics (including 
CV and NC) that are attested in almost all languages. It is informative that, despite 
world-wide variation in the contents and sizes of phonetic inventories, most 
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languages cluster in the average middle (Maddieson 2006). Given the large size of 
possible sounds that humans can produce, it is striking that they avoid more or 
less the same statistically marked ones and that almost the same sounds constitute 
the cores of their inventories. There must be a number of ecological constraints 
stemming from the articulatory and perceptual anatomies involved in the produc- 
tion and processing of phonetic sounds that account for this state of affairs. Pho- 
neticians are better placed than most of us to explain or investigate them. 

As the size of vocabularies expanded, it must have appeared useful to capture 
formally the semantic relations that obtained between some lexical items in multi- 
word utterances, i.e., once communication exceeded the one-word utterance stage 
consisting only of naming and pointing. I am not sure how long the putative holo- 
phrastic stage (would have) lasted, because it is difficult to believe that a hominine 
population would have evolved a rudimentary language just to name entities, and 
maybe also feelings and conditions, without providing more information. More 
plausible is the hypothesis of assuming some sort of telegraphic stage perhaps 
close enough to Bickerton’s (1990ff.) “protolanguage” but without all the stipula- 
tions he has invoked to distinguish it from modern languages. In its initial stage, it 
need not have been as complex as a pidgin, produced indeed by modern humans 
who had already spoken full-fledged languages (if only there were a universal 
grammar of pidgins at all!). The reason is that, as noted above, our hominine an- 
cestors did not yet have our modern kind of mind, had not yet developed modern 
kinds of social cultures, and therefore were not under the same ecological pres- 
sures for modern style communication, even in pidgin form or child language in 
its second year, pace Bickerton (1990ff.) and Tallerman (2007). 

Word-formation strategies (themselves also controlled by the relevant emer- 
gent combinatorial conventions) must have also started to emerge as alternative, 
phonetically economical and mnemonically efficient ways of expanding the vo- 
cabulary. Both affixation and compounding may have started that early (though 
practitioners of grammaticization may favor compounding), bearing in mind 
that subsequent history would not always maintain transparency. These strategies 
provide the benefit of facilitating both the formation and the recognition of new 
words, by analogy to previous complex forms, distributing the tasks between 


12. According to Pollick & de Vaal (2007: 8188, col. 1), an embryonic form of predication is 
attested in bonobos, whose “vocal communication (...) appears more dialogue-like, and includes 
soft peeps to draw attention to and ‘comment’ on novel items and environmental events (...), a 
characteristic shared with early human language development.” Unless this characteristic of the 
bonobos is homologous evolution, it may mean that the ability for predication may have started 
in an embryonic form long before hominines evolved language. We must remember that there 
has always been a predisposition for communication among all social animals. It is how homi- 
nines diverged from all the others in our evolution that makes all the difference. 
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memorization and creation. Though the size of the vocabulary used collectively 
by a language community can become awfully large (such as the over 100,000 
words commonly associated with English), a fluent speaker need not know all of 
them, as long as he/she knows the most common vocabulary and derivational 
morphemes and can therefore form or interpret some lexical items hitherto un- 
known to him/her from these materials. Thus, many innovations by different 
speakers are bound to be the same, as they are constrained by similar combinato- 
rial principles within the same community. Each fluent speaker/signer learns 
how to package concepts new to them either by exapting extant words (in the 
right pragmatic context) or by coining new ones from combinations of familiar 
lexical and derivational materials. 

From the point of view of the vocabulary, languages are ingenious technolo- 
gies that are so adaptive to new communicative needs and must be user-friendly.'? 
All users of the technology will of course not do things exactly the same way, espe- 
cially when there is choice between afhixation and compounding in the same lan- 
guage or even between different affixation strategies (pre-, in-, and suffixing), 
which generates overlapping and competing strategies and complicates the task 
for the analyst, but not necessarily for the speakers/signers, who are not in the least 
interested in figuring out what the overall architecture of their language is or will 
be like. Languages have definitely not emerged by design; Hockett (1959) was cer- 
tainly mistaken in speaking of “design features.” Linguistic “systems” illustrate 
“emergent systems,’ which makes it easier to explain why languages change over 
time, under the pressure of various ecological factors (Mufwene 2001, 2008).!4 

The expansion of the vocabulary appears to have been a consequence of the 
increase in hominines’ cognitive capacity, as pressure increased to name more 
entities (persons, animals, and objects), activities, and conditions. The same 
expansion of the cognitive capacity must also have driven the need to comment 
on particular entities, activities, and conditions. This development marked the 


13. Perhaps languages as technologies can be compared with each other according to how user- 
friendly they are, though this is also a function of how a particular user is more accustomed to 
doing particular things. One may also argue that, since variation is a fundamental characteristic 
of human populations both within and in relation to each other, this kind of inquiry is not par- 
ticularly fruitful. No population ever aimed at producing an optimal system, despite the preoc- 
cupation of some linguists with optimality. It’s all a matter of satisficing behavior, in which the 
interactants are happy with satisfying some but not necessarily all conditions for the use of 
(new) terms or the extension of the usage of extant ones. Adjustments are made ad hoc, often 
based on analogies, without anticipation of ultimate consequences for the system. 


14. Practitioners of usage-based grammar, such as Croft (2001) and Tomasello (2003), lead 
to the same kind of position as expressed here. For them grammar is the cumulation of pat- 
terns arising from the ways utterances are produced and have no ontological precedence to 
the latter. 
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emergence of predication strategies in languages, starting with the functional 
distinction between, on the one hand, susyecr or Topic and, on the other, 
PREDICATE. If the vocabulary had had no lexical categories until then, a distinction 
would have had to start at this point, with Nouns specializing for the function of 
ARGUMENT and VERBS for that of pPREDICATE.!° This hypothesis is consistent with 
Stokoe's (2001: 44) observation on the interdependence of lexical categorization 
and sentences: “the words in the lexicon must already be sorted by kind, and that 
sorting had to come from their roles in sentences.” However, he thinks that the 
distinction predates language and must have been in place in Homo erectus, when 
much of the communication was still gestural, a position apparently shared by 
Pollick & de Vaal (2007), as shown in note 13. 

It’s not evident yet that the category ADJECTIVE, which can also be used pred- 
icatively, would have started yet (in languages that have it, as it is not universal). 
Chances are that it would not have emerged before pressure mounted to articulate 
more explicitly or specifically how a noun picks out its referent out of a wide range 
of denotata. Once they emerged, adjectives would specialize for modifying nouns 
but would also be candidates for predicating states or conditions about the subject. 
Languages would then vary typologically regarding whether a copula is required 
when adjectives are used predicatively. 

Otherwise, predication may have started simply with utterances in which a 
lexical item is used as an argument and another as a predicate. The emergence of 
the lexical distinction between NoUN and vERB would facilitate a response to an- 
other need for a more informative communication system, viz., to specify reference 
for the subject/topic and time reference for the predicate, though one is reminded 
now of languages such as Mandarin in which aspEcrT is expressed grammatically 
but not TENSE. Modern languages suggest that these two time notions need not 
have arisen concurrently in the emergent grammars, as the time at which a state of 


15. Givon (1979) is particularly informative about which kinds of entities tend to be nouns and 
which ones tend to be verbs, crosslinguistically, although there are nominalizations of activities 
(e.g., construction, endangerment) and of states and conditions (e.g., insomnia, fatigue, drowsi- 
ness). It may well be that such nominalizations are consequences of speaking about activities, 
states, and conditions as events or entities that are identified or identifiable as arguments of 
predicates in utterances. On the other hand, Evans & Levinson (2009: 434-435) draw our atten- 
tion to the fact that in languages such as Straits Salish the fundamental distinction remains 
PREDICATE VS. ARGUMENT, which is indicated syntactically by the affixation of function-specific 
markers to the same lexical items for ‘man, ‘big’ ‘run; etc. In the lexicon, there is no particular 
way to identify them as either nouns or verbs. For the sake of not complicating the exposition, I 
will leave this typological option alone, though it can be used to support the position that it is 
perhaps more accurate to speak of the evolution of languages, in the plural. From the perspec- 
tive developed in the present essay, communication problems were not solved in identical ways 
from one population/culture to another. 
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affairs occurred can be expressed with a time adverb such as ‘now, ‘yesterday, and 
‘tomorrow. Besides, modern languages also suggest that the categories need not 
have been structured identically in different communities. It is also likely that the 
relevant distinctions did not all emerge at the same time, though this raises the 
question of figuring out what the most fundamental distinctions are in different 
systems. 

As with word-formation strategies, specifying reference need not have started 
as an affixation process either; the strategy may have been periphrastic, as in lan- 
guages with isolating morphosyntax, though we have no evidence for speculating, 
like von Humboldt (1836) and Schleicher (1853), that the morphosyntax of lan- 
guages has evolved from isolating to fusional systems, going through agglutinating 
and polysynthetic systems. The evolution of languages from holistic vocalizations 
to polysyllabic utterances and from holophrastic to multi-word utterances need 
not have favored the earlier emergence of isolating morphosyntax, though this was 
one of the possibilities. It is noteworthy that a number of philologists up to the 
19th century thought that the earliest forms of speech involved agglutination, per- 
haps because, in elliptical constructions, they are reminiscent of holophrasis, as 
most of the relevant information is agglutinated to the verb, which is uttered as 
one phonetic word. 

One particular phenomenon that is relevant to the issue of the evolutionary 
order of morphosyntactic structure, especially for those who will not consider the 
option of alternative choices by different populations of speakers, is quantification 
and demonstratives as part of specifying reference with nouns. All languages of 
the world have these strategies, and they are executed periphrastically, whereas 
markers of nominal NUMBER and GENDER, the latter of which is an optional gram- 
matical category (in the sense that a language may choose not to include it) can be 
expressed either periphrastically or by affixation, displaying typological variation 
across languages. '© 


16. ‘This is a most interesting case of the emergence of typological diversity, as some languages 
have semantic gender (associated with denotational properties, as in English), whereas some 
others have (survivals of) formal or arbitrary gender (as in Latin and the Romance languages), 
while others have developed noun classes (with more numerous distinctions and originally also 
associated with denotational properties, as in the Bantu languages), and still others have nu- 
meral classifying systems (also with more numerous distinctions and even more transparently 
associated with denotational properties, as in Sinitic and Austronesian languages). But variation 
in this domain does not stop here, as languages within these types also vary according to the 
specific parameters along which the distinctions are articulated and the specific number of dis- 
tinctions made. While they also underscore the fact that languages are cultural artifacts, such 
patterns of crosslinguistic variation call for a clearer articulation of the concept of “parameter- 
setting” (as opposed to pattern-identification, as one may expect, from an emergentist 
perspective), in ontogenetic language development. 
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It is also noteworthy that even incipient pidgins, which are associated with 
reduced morphosyntax, preserve numerals and demonstratives, which suggests 
that, from a phylogenetic perspective, they are deeply entrenched in the cognitive 
substratum of modern human capacity to communicate linguistically and must 
have emerged early, just like the organization of the first telegraphic utterances 
into subject/topic and predicate. Demonstratives and quantifiers such as ‘little’ and 
‘much are also acquired early in child language, just like the words for ‘no’ and 
‘yes, which, according to Jackendoff (1999), must also have emerged early in the 
protracted phylogeny of hominine languages. We may thus infer that along with, 
or soon after, the emergence of predication, our hominine ancestors developed 
strategies to express quantity (including basic numerals) and learned to situate the 
referents of nouns used in utterances spatially with demonstratives. Markers of 
NUMBER, and GENDER, and CASE, which are easily dispensed with in pidgins, ap- 
pear to be useful accessories that may have developed later, as the communicative 
technology became more sophisticated.” 

The question of time reference appears to be more difficult. Creoles tell us that 
ASPECT must be distinguished from TENSE, and the former may be more critical 
than the latter, as is confirmed by Sinitic languages, for instance. The reason prob- 
ably is that, as noted above, deictic adverbs such as ‘now, ‘today’ ‘yesterday’, ‘to- 
morrow, and ‘later, which emerge early in ontogenetic language development and 
probably did in language phylogeny, can be used to situate states of affairs in time, 
just like demonstratives can situate entities spatially. It is not evident either when 
markers of MOOD may have evolved, to express the speaker’s/signer’s epistemic 
stand toward their utterances, thus making it possible to distinguish what is fac- 
tual from what is hypothetical or conditional, what is known from observation or 
direct experience from what is inferred or has been learned by hearsay, etc.. Though 
MOOD can be expressed periphrastically (albeit through modal verbs/predicates), 


17. The more involved reader will have noticed by now that gradualism is generally assumed in 
this paper, because, given the complexity of the architecture of modern human languages, any 
abrupt transition from no language or a protolanguage (Bickerton 1990ff.) to a language would 
have been too overwhelming to handle. It would have caused the collapse of the attempts to 
communicate explicitly and more efficiently, as the emergent technology would have been over- 
whelming and too difficult to adapt to. Although hominine evolution involved mutations, the 
ensuing population dis- and replacements did not occur in one swoop, as is evident from, for 
example, the protracted coexistence of the Neanderthal and Homo sapiens over tens of thou- 
sands of years. The genetic changes associated with evolution to modern languages needed to be 
reproduced and spread, gaining the critical mass for population-wide changes to take effect. It is 
also unlikely that the necessary communicative needs themselves all arose around the same 
time, because the complexity of hominine cognitive capacity must itself have evolved gradually. 
The architecture of language itself also suggests that the emergence of some structural features 
presupposed that of others before them, as I try to show in the rest of this essay. 
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combinations of verbs for this purpose (as also in the case of periphrastic aspec- 
tual constructions) presuppose the emergence of embedding, a strategy that en- 
ables a verb/predicate to function as the complement/argument of another. Is 
mood the kind of nuancing strategy in language that may have emerged later than 
the distinction between statements, questions, and orders/requests? The latter are 
fundamental aspects of social interaction and communication, which the linguis- 
tic technology was being developed to serve, for the purposes of facilitating coop- 
eration! I suspect that strategies for these illocutionary distinctions must have 
emerged early in communication, just as they also do in child language. They are 
not lost in pidgins, though the specific strategies used may not be those in cur- 
rency in the languages they evolved from, the so-called “lexifiers.” 

‘These considerations raise the important question of whether all aspects of 
predication started at the same time. What was said about predication above ap- 
plies without any obvious problem to intransitive, mono-argument utterances of 
the SUBJECT/TOPIC + PREDICATE kind. It says nothing about larger predicate 
phrases involving a second and possibly a third argument (as with the verbs fell, 
take, and put in English) and even adverbial adjuncts. Future research will have to 
address the question of whether the earliest cases of predication were restricted to 
SUBJECT/TOPIC + INTRANSITIVE PREDICATE constructions (expressing what the 
referent of the subject/topic is doing/experiencing or has done/experienced) or 
also involved constructions expressing a relation between two or more arguments 
in terms of what one is doing, or how one is related, to the other.!* Is the evolution 
from holophrastic utterances (simply naming entities and states of affairs) to those 
involving predication likely to have involved a shift to the predicate as the central 
element of an utterance, in which, when more than one argument is involved, it 
specifies what relation obtains between them? 

This may be a case where ontogenetic development may not be informative 
about phylogenetic development, because children are exposed to both intransi- 
tive and transitive constructions, aside from being born with a late Homo-sapiens 
mind that is predisposed to any linguistic distinction that is consistent with his/ 
her cognitive maturation. Our hominine ancestors need not have developed both 
types of constructions (transitive and intransitive) concurrently. We need different 
kinds of evidence to answer the question. 

Be this as it may, multiple-argument constructions would have called for strat- 
egies (argument order) and/or devices (case makers or adpositions) that identify 
the arguments’ thematic roles. Is it possible that, as suggested by Jackendoff (1999), 


18. Just in order to keep the exposition manageable, I will ignore typologically different, unac- 
cusative languages, as well as polysynthetic ones whose elliptical utterances may not even con- 
tain an independent, non-agglutinated subject or topic. 
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our hominine ancestors would have first relied on constituent order (such as in 
di-transitive constructions, in which the dative argument precedes the direct ob- 
ject)? The hypothesis is not implausible if one agrees with students of grammatici- 
zation such as Heine & Kuteva (2007) that inflections and other grammatical 
morphemes are erstwhile periphrastic markers, consistent with the alternative of 
the evolution of non-isolating morphosyntax discussed above.!? 

The emergence of predication obviously involved quite a number of details 
that our hominine ancestors had to attend to. Short of involving a miracle, it is not 
evident that the emergence of the morphosyntactic devices and strategies used in 
modern languages could have occurred in the abrupt, catastrophic style of evolu- 
tion hypothesized by Bickerton (1990ff.), endorsed by Tallerman (2007), and even 
still assumed by those such as Wray (2002) who offer alternative hypotheses. 

In any case, predication brought along with it the possibility of what Hockett 
(1959) identified as “displacement,” which enables interactants to exchange infor- 
mation not only about the hic et nunc of their interactions (which is more typical 
of holophrastic statements) but also about the past and the future, as about places 
other than where the speaker/signer is. Thus predication would have exerted a lot 
of cognitive pressure to specify time reference, in order to situate states of affairs in 
time. Assuming that modern languages’ specifiers of the verb are the counterparts 
of those associated with the noun (although modern generative syntax identifies 
them as heads of projections), the emergence of this capacity to situate states of 
affairs in time may have emerged concurrently with the capacity to specify where 
the referents are situated spatially (with demonstratives) and whether one or more 
than one is involved (with number markers), or even whether reference is generic 
rather than to particulars (with articles or some other devices). 

Generic reference for the noun may also have been a concomitant of the ex- 
pression of habits and typical conditions. Now, the question is whether, at some 
point in the evolutionary trajectory of language(s), there could have been an ex- 
plosive evolution in the hominine mental capacity that caused all these strategies 
to emerge concurrently in language(s), while allowing for typological variation 
from one to the other. Or may the evolution have been as gradual as in child lan- 
guage (Mufwene 1999), though the time between the emergences of the different 


19. I will leave alone the question of when adjuncts may have emerged, especially those that 
have nothing to do with situating states of affairs in place and time. The fact that they are op- 
tional suggests that they may have emerged later than the basic strategy of predication. On the 
other hand, it is also conceivable to distinguish markers of time (and place) from the other ad- 
juncts. Adverbial clauses are of a different order of complexity that must be discussed separately 
from manner and instrumental adverbs, for instance. They presuppose prior development of 
clause formation strategies, thus of predication, and of devices that connect adjuncts to the main 
clause. 
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strategies/devices may also have involved short phylogenetic distances? I am not 
sure that the question can be answered conclusively at this point. 

All languages make distinctions along the lines expressed above, despite the 
obvious typological variation that linguistics has articulated since the 19th century. 
They all also involve negation of the predicate phrase or constituents within the 
predicate phrase, which must have emerged early too, as is also suggested by child 
language development, though here too, there is typological variation regarding 
the position of the relevant marker in an utterance and whether, in the first place, 
the marker is the same for wide-scope and narrow-scope negation.”° I am inclined 
to consider typological variation as evidence for polygenesis in the phylogenetic 
emergence of language, since the geographical distribution of paleontological evi- 
dence, dispersed over the huge area of East(ern) Africa, defies the possibility of 
monogenesis. Besides, modern humans continue to show that there is usually more 
than one way of solving the same problem, notwithstanding the fact that no global 
typology has emerged that shows what particular parametric combinations are not 
permitted in a language. Greenberg (1966) identified only common tendencies 
and some implicational universals among the combinations that are attested. 

Before we proceed with the larger aspects of syntax, we should discuss a cou- 
ple of things about the morphosyntactic delimitation of the noun, viz., not all lan- 
guages have articles, no more than those who have them have a uniform way of 
articulating their distinctions morphosyntactically. (For instance, the functional 
distribution of the definite and zero articles is not identical in English and in 
French.) On the other hand, it appears that definite articles usually evolved from 
(distal) demonstratives and indefinite ones from the quantifier ‘one’ in many lan- 
guages. So, it may be assumed that grammaticization was involved in this case, 
which suggests later, optional emergence of these markers compared to quantifiers 
and demonstratives. These considerations too lead to the conclusion that the lin- 
guistic technology evolved gradually and incrementally, with different hominine 
populations innovating similar but not identical strategies at the same stages of 
cognitive evolution, a normal state of affairs in cultural evolution. Similarities in 
evolutionary trajectories do not necessarily reflect constraints exerted by UG. 
They may also reflect system-inherent limitations that offered no other alterna- 
tives but to exapt those materials that are the most adaptive for the new functions. 
The fact that similar grammaticization processes have been repeated in the histories 


20. The negation discussed here is different from what is the counterpart of ‘yes’ and can occur 
elliptically. As is evident from French, which has non and ne... pas as basic negation markers, the 
two kinds need not have the same form; nor need they have emerged around the same time. 
(French also uses pas alone for narrow-scope negation.) Child language development suggests 
that the independent negation is evolutionarily anterior to negation with scope over or within 
the predicate phrase. 
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of modern languages may reflect a bias that the modern problem-solving mind 
has undergone in human evolution. Perhaps this is the best evolutionary account 
there is for the concept of UG. The concurrent variation regarding quantifiers and 
demonstratives would have involved word order (notwithstanding the internal 
structure of the domains), about which nothing has been said so far.”! 

Whether the emergent grammar opted for SVO, SOV, or some other kind of 
major-constituent order in the sentence, and for head first or head last in the ar- 
rangement of constituents within the noun phrase appears to be as inconsequen- 
tial to the emergent technology as the design of cars with the steering wheel on the 
left or the right side of a four-wheel vehicle, though there are secondary conse- 
quences, such as the location of the ignition key and dashboard commands. 

The same variation also emerged regarding whether or not a language would 
resort to case markers, postpositions, or just word order to express thematic rela- 
tions in an utterance.”* However, all languages have developed markers of rela- 
tions between some constituents, to express joint participation (conjunctions), 
accompaniment (comitative relations), and instrumental, manner, and temporal 
adjuncts to the core predicate phrase, among others. Since the direction and orig- 
ination of motion is, more often than not, not incorporated in the meaning of the 
verb, languages usually include adpositions and/or special case markers to mark 
these verb complements. The relevant systems are sometimes elaborate in some 
languages, such as to, into, onto, out of, and from, in English. In some languages, 
they are not, especially if the direction and origination of motion are incorporated 
in the relevant verbs. Some languages have opted for redundancy, expressing with 
the adposition what is incorporated in the meaning of the verb, such as extract NP 
from and import NP (in)to. 

Also, as Heine & Kuteva (2007) show, the history of many languages suggests 
that many adpositions, if not all of them, have evolved from erstwhile nouns or 
verbs (the case of prepositions such as down, inside, behind, while, including, dur- 
ing, except(ing), and regarding in English). Thus, they are apparently later develop- 
ments, by exaptation of extant materials, than the fundamental categories of nouns 


21. An interesting analog to the emergence of typological variation in the evolution of lan- 
guages, assuming polygenesis, is the evolution of desktop computers, with the Mac and PC en- 
gineers developing similar but not identical technologies at more or less the same times with 
different degrees of success with certain tasks. Despite differences, they satisfy the needs of their 
customers, in their own respective cultures, and they do not fail in their most basic tasks. Each 
technology is also ready to adapt to their customers’ new needs. 


22. Escure (2009) is perhaps the latest creolist discussing serial verb constructions to point out 
that serial verbs often express thematic relations that some languages express only with adposi- 
tions. Serial verb constructions are indeed an important typological option for sentence expan- 
sion, to which I return below. 
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and verbs. How does their emergence bear on that of predication? Can we assume 
that, like many other aspects of language, the internal structure of the predicate 
phrase evolved incrementally? At what stage of language phylogeny are the syntac- 
tic structures requiring adpositions likely to have emerged? May those marking 
complements of motion verbs, or transfer of possession for some verbs (like take 
and give), have emerged earlier than those marking adjuncts? 

It seems to me that the question is really that of when these devices emerged in 
language, rather than that of whether they emerged later than the categories of nouns 
and verbs. On the other hand, we cannot overlook the fact that these functional 
categories (prepositions and conjunctions) are not as extensive in some languages 
than others, in which combinations of nouns and a connective functionally similar 
to of in English are used to express the same grammatical relations (in fact as in in 
front/back of, on top of, aside from, except for, and along with in English itself).”3 

These facts suggest that perhaps just a small core of “all-purpose” connectives 
were used, aside from verbs, to mark grammatical relations in the early stages of 
the phylogenetic emergence of language. The derivative prepositions and conjunc- 
tions that have extended the inventories of functional categories must have 
emerged later, apparently in order to make communication either more precise or 
less vague, depending on one’s idea of what precise communication is. Note that, 
even during this particular phase of the evolution of language, there is also room 
for typological variation even within the same language. While there are many 
cases where new unitary prepositions have been grammaticized from nouns and 
verbs, there are also many cases of composite prepositions. Some languages have 
chosen this particular option. French has a few, such as au sujet de ‘about, autour 
de ‘around, en provenance de ‘from, and au dessus de ‘on top of, above’. The trans- 
parency of the prepositional phrases (not preposition phrase as P + NP) is as strik- 
ing as that of the semi-auxiliary be going to (+ infinitive) in English or that of 
prepositional phrases such as on top of, in back of, in front of. Again the evidence 
speaks for a gradual, perhaps protracted evolution of language toward more and 
more productive and increasingly complex syntax, reflecting how extant materials 
were coopted toward the production of new expressions. As suggested above, we 
have no reason for assuming that at the initial stages of the emergence of language 
our hominine ancestors’ minds and their communicative needs were just like ours 


23. This kind of vocabulary expansion is not restricted to prepositions only. It is also attested in 
the category of quantifiers, with composite expressions such as a lot of, a number of, an amount 
of, a multitude of, and a bunch of in English. That their lexical status is somewhat different from 
quantifiers such as many, several, and few is made evident by the fact that they are intensified 
differently, for instance, a whole lot of, a (very) large number of, and a certain amount of, as op- 
posed to very many and very few, with the latter behaving like adjectives and the former like 
nominal phrases. 
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today. Consistent with Darwinian evolutionary theory, the gradual complexifica- 
tion of the language technology suggested by the data may reflect the gradual com- 
plexification of the hominine cognitive capacity and need for more sophisticated 
communication. 

The gradual complexification of the grammar is also evident in the use of a 
clause as the complement ofa higher predicate, as the modifier ofa noun (e.g., relative 
clauses), or as an adjunct adverbial clause. Made possible by the principle of 
recursion,” this expansion of the sentence also led to the emergence of comple- 
mentizers, among other alternative devices (such as the conversion of the verb into 
a gerund, i.e., verbal noun), in systems where particular subordination strategies 
emerged (as in Western European languages), as opposed to verb serialization.”* 

Givon (2009) hypothesizes that subordination evolved from serialization. A 
problem with this hypothesis is that we have no evidence that all subordinators 
evolved from verbs, although there are some languages where the serial verb ‘say’ 
has evolved to function as a complementizer. Although this does not rule out the 
possibility that the first instances of clause-embedding involved no subordinators, 
it is also noteworthy that some serial verbs express thematic relations that are 


24. Although recursion has properly been associated with the repetition of particular combina- 
torial patterns in syntax, typically with complement and modifying clauses and with multiple 
embeddings of preposition and noun phrases, one may also argue that it is deeply rooted in the 
strategy of compositionality itself, which allows longer and longer expressions. Originally sug- 
gested by Martinet’s (1960) principle of “double articulation” and Hockett’s (1959) and Hockett 
& Ascher’s (1964) “duality of patterning,” compositionality applies in more than one module of 
the architecture of language, viz., in combinations of sounds into morphemes and words, as well 
as in combinations of the latter into sentences, eventually in combinations of the latter into 
larger discourse. Syntax has focused on a specific kind of it, whereby some patterns for expand- 
ing constituents are repeated within themselves to produce even larger phrases. We also know 
from modern grammars that constraints emerged that restrict particular combinations and 
various syntactic processes from occurring under particular conditions. The syntactic con- 
straints may have a cognitive basis, while the phonetic ones may have acoustic and/or articula- 
tory motivations. We will not deal with them in this essay. 


25. We must bear in mind that the term serial verb construction was coined in reference to the 
fact that in languages producing such constructions verbs or verb phrases are sequenced with- 
out conjunctions or subordinating morphemes between them. The syntactic relations between 
the serialized verbs are not uniform: the verbs may be considered as conjoined in some con- 
structions, whereas in some others the serial verb is the object of the head verb, as in the Gullah 
construction Uh tell um come home ‘I told him/her [to] come home. The mischaracterization of 
some serialized verbs as “instrumental” and “dative” also reflects the fact that their semantics are 
not uniform from one construction to another. As argued in Mufwene (1989, 1990), serializa- 
tion is just an alternative strategy, in opposition to morphosyntactic subordination and conjunc- 
tion (with overt markers for specific syntactic functions), for the formation of complex sentences. 
See also Evans & Levinson (2009: 442) for a similar position. 
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expressed with adpositions in some other languages. This suggests that serial verb 
constructions are not simply the counterparts of subordinate clauses. One might 
otherwise argue that coordinate clauses and preposition phrases all evolved from 
serial verb constructions. The evidence for the position is just not compelling; it is 
typically based on idiomatic translations of serial verb constructions into English 
or French. The typological evidence simply suggests that different populations 
have chosen to organize their grammatical systems differently, consistent with the 
earlier observation that the human mind is capable of producing alternative solu- 
tions to the same problems. This is a central point in Evans & Levinson’s (2009) 
argument for not subordinating linguistic diversity to language universals. From 
the perspective of the present essay, the relevant problem is how to pack various 
information hierarchically or otherwise in a sentence. 

A less controversial hypothesis is that subordinators emerged after the appli- 
cation of recursion to clause-size units made it possible to expand a clause with 
another clause that functions as a complement of a verb, as the modifier of a noun, 
or as an adverbial adjunct. Languages vary regarding the strategies they evolved to 
mark such clauses. More research is needed regarding what structures had to be in 
place before others that possibly coopted them emerged. This will then help us 
determine with some certainty at what particular phase of the phylogeny of lan- 
guages some devices and structures that do not appear to be fundamental evolved. 
The fact that the primary characteristic of serial verb constructions is not to have 
any function words between the sequenced verbs and for the verbs to be unin- 
flected may help us sort out what are some of the non-fundamental devices in the 
architecture of modern languages. Candidates include not only complementizers 
but also the distinction between finite and nonfinite forms of verbs, as well the 
nominalization of the subordinate construction as done in Japanese and Korean 
before a postposition is attached to it to mark its syntactic function.”® 

The above observation is not intended to deny the usefulness of complemen- 
tizers and other subordinators in facilitating sentence-processing. It simply high- 
lights the fact that some devices in the architecture of languages may be treated as 
accessories that enhance how they function but are not essential or critical parts 
(Mufwene 1999). This could also have been said of the copula, gender, nominal 
number, and agreement strategies (between an overt subject and its verbal 


26. The way complement clauses in Japanese and Korean are bounded morphologically with a 
nominalizer followed by a postposition suggests in fact that speakers had to realize at some 
point that the noun or noun phrase could designate not only an entity (material or abstract) but 
even a state of affairs. What Japanese and Korean do so transparently is reminding the hearer 
that the complement clause must be interpreted as a noun phrase. We can thus be certain that 
the capacity for a verb to have a nominal object provided the substrate for the emergence of 
complement clauses. 
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predicate and between a noun and its adjectival modifier), the kinds of things that 
pidgins dispense with (not inevitably!) and whose functions children understand 
later during child language development. 

Recursion has also enabled the modification of a noun by another noun as in 
the English my teacher’ book’ cover and its French translation la couverture du 
livre de mon professeur, as well as the modification of a preposition phrase by an- 
other preposition phrase as in the wine in the glass on the table by the window. 
Whenever these kinds of constructions emerged during the phylogenetic emer- 
gence of language(s) (indeed when English had not emerged yet!), they presup- 
pose the earlier emergence of the simpler constructions that could be coopted into 
the production of the larger ones.’” Exaptation was thus a key strategy in the phy- 
logenetic emergence of language(s). 


4. Conclusions 


The primary goal of this essay was to show how languages can be thought of as 
emergent technology whose function is to enable sophisticated and more explicit 
communication than, as far as we know, nonhuman primates’ vocalizations and 
gestures. Though it is more fashionable to think of writing as technology, that of 
putting on material objects (clay, papyrus, paper, and now the computer screen) 
what would otherwise be spoken, language is itself technology developed through 
the domestication by hominines of their own anatomy to express their thoughts 
and feelings, to describe various states of affairs around them, to relate past experi- 
ences, to plan future states of affairs, and, as claimed by several students of the 
evolution of language, to cooperate toward the sustenance of their societies. 

The second goal was to speculate on how this technology evolved, albeit in an 
unplanned way, by exaptations by hominines of extant structures, in response to 


27. ‘The reader should remember that the emergence of modern languages such as English and 
French did not follow the same evolutionary trajectory discussed here. They reflect historical 
change from ancestor languages and speciation under particular contact conditions, although 
some of the processes involved in the phylogenetic emergence of languages have been repeated, 
such as grammaticization. Some typological shifts have also taken place during the relevant 
changes such as from VSO to SVO in English. All this is only part of the complex story showing 
that the linguistic technology continues to be adapted to its users’ communicative needs under 
particular ecological pressures, including language contact (Mufwene 2009). This all means that 
what language history can tell us about the phylogenetic emergence of language may be very 
limited, especially since historical changes have not always proceeded from less to more com- 
plex structures. Often, just the opposite has been true, such as in the morphosyntax of Indo- 
European languages such as English and the Romance languages, from the point of view of 
distinctions within particular domains or of constraints on the application of particular rules. 
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growing communication pressures arising from their expanding mental and cog- 
nitive capacities. It seems obvious that the technology started with the capacity to 
name entities and states of affairs, with the phonetic inventories increasing up to a 
critical mass (variable from one population to another) to satisfy the continuous 
expansion of the vocabulary. This evolution already set hominines apart from 
nonhuman primates, whose vocabulary of calls is considered limited to innate 
needs and does not expand. The next step was the emergence of predication and 
strategies to specify reference (for both nouns and verbs”®), a capacity that linguis- 
tic apes have hardly acquired (at least not to the extent observable in modern hu- 
man languages), which underscores the evolution of a mind particularly hominine 
as a critical evolutionary prerequisite to the emergence of the linguistic technolo- 
gy. So what sets us apart from other animals is not so much language as the mind 
that made it all possible and also accounts for why our cognition and cultures are 
also so different from those of other animals. 

Past these initial stages, the story of the phylogenetic trajectory of language 
has been more difficult to construct, defying any unilinear and perhaps also rec- 
tilinear account. My admittedly unfinished story shows how partial accounts such 
as Jackendoff’s (1999), Heine & Kuteva’s (2007), and Givon’s (2009) are. My spec- 
ulations agree only partially with theirs, certainly in showing that the evolution 
was incremental and that some strategies and devices presuppose prior emer- 
gence of some others that served as their substrates. This conclusion underscores 
the need to better understand the ways in which the different aspects of the archi- 
tecture of language are interconnected. We must sort out what particular 
components have evolved concurrently and which ones presuppose the earlier 
emergence of others. We must also think hard over what counts as evidence for 
our hypotheses. 

I have also highlighted the relevance of typological variation to the question 
of whether we should conjecture monogenesis or polygenesis in the evolution of 
languages. Taking into account the geographical distribution of paleontological 
evidence across East(ern) Africa, it is difficult to dismiss the possibility of poly- 
genesis as what made typological variation possible. Different hominine popula- 
tions need not have solved their communication problems in identical ways, 
though they were developing technologies that co-opted the same anatomical 
structures to produce speech or manual signs. Uniformity in the phylogenetic 
emergence of languages seems to end with the choice of the same materials in the 
development of the technology and the common, universal principles that apply 


28. I am aware of the tradition that associates reference only with nouns, but linguists also 
speak of “time reference.” It is both meanings that are intended here. 
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because they may be consequences of both the materials used and the mind pro- 
ducing the technology.”? 

These considerations suggest that we could as well be speaking of the evolu- 
tion of languages, indeed in the plural, with emphasis on diversity, at least 
sometimes, rather than just speaking of the emergence of language, in a non- 
individuated and unified way. If the commonalities are largely consequences of 
both the materials used to produce the technology and the mind driving the emer- 
gence of the technology, then scholars such as MacNeilage (2008), Koster (2009), 
Lieberman (2010), Jackendoff (2010), and others are justified in questioning the 
postulation of a language organ or Universal Grammar as THE central factor ac- 
counting for how languages have evolved and for their common properties. If any- 
thing, the name language organ itself must be a misnomer, especially because it 
corresponds to no anatomical part of hominine structure, nor to any combination 
of organs specializing just for language.*° Universal Grammar, as the sum of men- 
tal specifications and/or constraints putatively determining the properties shared 
by all languages, would be a consequence of the evolution I have attempted to ac- 
count for here. The biological foundation of languages, more accurately interpre- 
table as cultural artifacts, appears to lie in the way that biological evolution 
endowed hominines with a particular evolving mind capable of producing the 
sophisticated modern communication technologies we now produce and use. Be- 
cause of other cultural artifacts and conventions that only humans produce and 
use today that other animals do not, I submit that what distinguishes humans from 
other animals is not so much language as the mind that made it all possible. Of the 
lessons I have learned from this writing this essay, perhaps the most significant is 
how complex the subject matter of the phylogenetic emergence of language(s) is 
and what a particularly rich perspective one gains from thinking of language(s) as 


29. Evans & Levinson (2009) are among those that are reluctant to invoke UG (under whatever 
name) to account for the common properties, especially constraints, shared by most, if not all, 
languages. They include a relatively detailed discussion on p. 439 that I will not try to summarize 
here. 


30. David Lightfoot, who has perhaps written the most on the subject matter, is not particu- 
larly informative. To wit, Lightfoot (2005) characterizes the “language organ” variously as “bio- 
logical? “psychological,” “bodily, and yet not anatomical (in that it cannot be localized in the 
brain), quite unaware of the confusion his inconsistent characterization produces. He also ar- 
gues that the “linguistic capacity [or grammar, also identified with the “language organ”] is 
better understood in functional rather than anatomical terms” (57) and later compares the de- 
velopment of grammar in the child to the growth of hair (58), subject to particular ecological 
conditions, the “trigger” This is at best an elusive characterization of UG that can hardly be op- 
erationalized to explain either how language evolved or how specific languages are learned by 
their speakers. 
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technology, and especially how many questions arise, the answers to which are not 
obvious. 
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